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lift (54) Title: IDENTIFICATION AND VALIDATION OF NOVEL TARGETS FOR AGROCHEMICALS 
0C 

^ (57) Abstract: The invention relates to a method for identifying and validating plant targets for agrochemicals, comprising the steps 
2 of determining gene or protein expression profiles in function of the progression of an essential biological process in a plant, and the 
subsequent downregulation of expression of said gene or protein in a plant cell. More particularly, the effects of downregulation of the 
O candidate target gene were directly monitored on plants locally infected with a vector mediating viral induced gene suppression in that 
^ infected plant area. The invention also relates to isolated plant genes encoding proteins involved in plant growth and development. 
^ The invention also relates to plants tolerant to agrochemicals such as herbicides or pesticides. 
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IDENTIFICATION AND VALIDATION OF NOVEL TARGETS FOR 

AGROCHEMICALS 

The invention relates to isolated plant genes encoding proteins essential for plant growth and 
development and to methods for identifying and validating these genes/proteins as target 
5 genes/proteins for agrochemicals, such as herbicides. A target for an agrochemical is a gene 
or a protein where the agrochemical interferes with when applied to the target organism. 

For the identification and validation of useful agrochemicals, the agrochemical industry 
traditionally relied on in vivo screening methods wherein chemical compounds were brought 

10 into direct contact with the living target organisms (e.g. plants for herbicide screening, insects 
for insecticide screening, etc.). However due to (i) the dramatic increase in the number of 
compounds that need to be screened to find a successful new agrochemical product, and (ii) 
the need to rely on very small quantities of compound such as are available in a combinatorial 
chemistry based compound libraries, and (iii) the need to identify compounds with a novel 

15 mode of action, the industry has developed a considerable interest in using more efficient and 
faster in vitro screening methods. 

To render such in vitro screening methods more successful, it is essential to carefully sekot 
the tested target gene/proteins and/or the tested agrochemicals. It has been described th:- a 

20 more practical in vitro approach for finding new agrochemicals would involve identification of 
target genes/proteins against which the agrochemical compounds could possibly work. For this 
process identification of suitable target genes/proteins, the conventional methods make use of 
gene knock-outs of the target organism. Gene knock-out libraries are generally made as a 
random collection of thousands of gene knock-outs.ln these methods it is investigated if the 

25 gene/protein is essential for the growth and/or viability of the organism, since the knockout of 
an essential gene (when present in a homozygous state) leads to a lethal or otherwise 
detrimental effect on the organism. The indication that said gene/protein is essential to the 
organisms makes it a suitable target for an agrochemical. These conventional methods are still 
cumbersome and time consuminy because uf the use of gene-knockouts. Other techn i quoo - 

30 that are useful to estimate the essential character of a gene or its corresponding protein are 
based on the downregulation of said gene or protein for example via anti-sense expression 
technology (WO01 07601 ). 

To render an in vitro screening for agrochemicals more successful, it is essential to carefully 

35 select the tested target gene/proteins. Therefore a more practical in vitro approach for finding 

new agrochemicals could be a multistep process involving the steps of (1) identification of 

target genes/proteins against which the agrochemical compounds could possibly work, (2) 

1 
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validation of the candidate target gene as being an essential gene/protein for the organism and 
(3) use of these target genes/proteins in an in vitro screening procedure in which the chemical 
compounds are tested. 

It is the aim of the present invention to develop a process for the more efficient identification of 
5 candidate target genes/proteins for agrochemicals, combined with the more efficient validation 
of the target genes/proteins. It is a further aim of the invention to provide this process in order 
to design more efficiently the screening procedure with the agrochemical compound. 

The method of the present invention is based on the direct use of genetic information for 
10 example generated by expression profiling of the candidate target genes/proteins, for the 
identification and the validation of the targets. 

Therefore according to a first embodiment of the present invention, there is now provided a 
method for identifying and validating plant genes/proteins as targets for agrochemicals, said 
1 5 method comprising the steps of: 

a. determining gene or protein expression profiles during a biological process of a plant or 
plant cell, said biological process being necessary for the viability or the growth of the 
plant or plant cell; 

b. selecting genes or proteins having altered expression during said biological process, 
20 c. cloning said selected gene or the nucleic acid encoding said protein in its full-length or 

partial form, 

d. incorporating said nucleic acid in a vector designed for downregulation of expression of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant 
cell. 

25 

The aim of methods of the present invention is the identification of target gene(s)/protein(s) out 
of a broad range of candidate plant genes/proteins. The identification step is achieved by the 
techniques of expression profiling described in the following embodiments. Since the method 
of the present invention can be used for identification of genes/proteins or proteins, the term 

30 "target" as used herein can mean a gene as well as a gene product, namely a protein, 
polypeptide or peptide. With the. expression "target for an agrochemical" is meant a protein as 
well as a gene or nucleic acid encoding such protein, and when such target is inhibited, 
stimulated or otherwise disrupted in its normal activity by an agrochemical compound, this 
would lead to a desired effect in a target organism. The invention aims at efficiently identifying 

35 targets for agrochemicals. Said agrochemicals can be herbicides or pesticides as well as 
growth stimulators or growth regulators. 
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Target identification means selecting candidate targets from a larger number of genes/proteins 
or proteins on the basis of certain properties that give such a molecule a higher probability of 
being a suitable target than other molecules which do not exhibit said properties. 
A herbicide target is a protein or gene that when inhibited, stimulated or otherwise disrupted in 
5 its normal activity by a compound would kill the (weedy) target plant or have a strong negative 
effect on its growth, said compound would therefore be a candidate herbicide. An insecticide 
target is a protein or gene that when inhibited, stimulated or otherwise disrupted in its normal 
activity by a compound would kill the insect pest or have a strong negative effect on its growth, 
said compound would therefore be a candidate insecticide. A plant growth regulator (PGR) 
10 target is a protein or gene that when inhibited, stimulated or otherwise disrupted in its normal 
activity by a compound would promote or alter in a desirable way the growth of plant, said 
compound would therefore be a candidate PGR. 

Nowadays a lot of genomic information, e.g. gene sequences, expression profiles, homologies 
15 and putative functionality, is available from genomic sequencing and expression studies in 
several target organisms. It is therefore of interest to develop a new method to identify and 
validate genes/proteins as candidate targets for agrochemicais, such methods being based on 
a direct use of such genomic information. This use of genomic information, e.g. the expression 
level of a gene, allows the selection of a limited set of appropriate candidate genes/proteins. 
20 Only this limited set of genes is then tested in the validation step, contributing to a higher 
efficiency and success rate of the screening procedure for agrochemicais. Furthermore, the 
genetic information, e.g. the functional data of the putative target gene/protein, is used as a 
basis to design more efficiently the in vitro screening procedure with the agrochemical 
compound(s) under investigation. 

25 

The present invention discloses methods that allow for the identification and validation of target 
genes/proteins for agrochemicais out of the broad range of possible genes/proteins and 
proteins. It therefore allows genes or proteins to be selected for the development of suitable in 
vitro screening methods for the screening of novel and efficient agrochemicais. 

30 According to a first step of the methods of the present invention target genes or gene products 
are identified by using transcript profiling of the genomic content of a cell. By using this 
technique one immediately obtains genomic data (sequences and expression level) as well as 
a functional indication of the candidate target gene or gene product. Thus this method is useful 
for a first identification and selection of possible agrochemical target genes/proteins, since it 

35 provides as a bonus genomic and functional data on the candidate target. A good candidate 
target gene is a gene of which the expression varies significantly over the course of an 
essential biological process of the cell, since that is an indication that the gene/protein is 

3 
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involved in that biological process The present application describes for the first time that the 
determination of an expression profile of a gene during the progression of an essential 
biological process is used to identify possible agrochemical targets. 

5 The expression profiling in the target identification steps of the method of the present invention 
is carried out in function of the progression of a process that is essential for plant growth 
and/or plant development and/or plant viability. In one preferred embodiment of the present 
invention, the essential process that is monitored in the target identification step is the process 
of cell division. Accordingly, in a particular embodiment of the invention, the method to identify 
10 target genes/proteins for agrochemicals is based on the transcript profiling of genes/proteins 
that are specifically involved in cell division. Therefore the invention provides a method as 
mentioned above, wherein said biological process cell division. 

Other biological processes that may be monitored for the identification and validation of 
15 agrochemical targets are for instance processes that are essential for seed germination, leaf 
formation, etc. 

The term expression profiling means determining the time and/or place when or where a gene 
or a protein is active. Particularly for a gene, this is achieved by monitoring the level of 
20 transcripts and therefore in the case of gene expression profiling the term transcript profiling or 
mRNA profiling is used. 

Generally, the expression profiling in the methods of the present invention is carried out in 
function of the progression of a process that is essential for plant growth and/or development 
and/or plant viability. To achieve this, the process of interest is synchronized in a sufficient 

25 number of cells (for example in a cell culture) or organisms to allow collecting samples for 
expression profiling representing various stages of said process. Target identification then 
consists in selecting those genes or proteins that show significant changes in expression levels 
in function of the progression of the process of interest. It are those genes or proteins that are 
likely to be strongly involved or to be essential in said process. 

30 The term "essential" means that if the gene or the gene product cannot function as normal in 
the cell or organism, this will have significant implication in the cell growth or cell development 
or other vital functions of the cell or organism. 

According to the invention, the expression profiling can be studied at the level of m-RNA, using 
35 transcript profiling techniques, or alternatively at the level of protein, using proteomics-based 
approaches. 



4 
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in one preferred embodiment of the invention, m-RNA profiling is used for identification of 
target genes/proteins and expression levels may be quantified via techniques that are well 
known to the man skilled in the art. For instance, mRNA-profiling can be performed using 
micro-array or macro-array technologies, this method however requires that the gene 
5 sequences are known (full length sequences or at least partial sequences) and are physically 
available for coating on the micro or macro array surface. Standard chips are being 
commercialised for Arabidopsis, and sufficient sequence information is now available for 
different plant species (including rice) to allow sufficient sequence data for this approach. 
Another approach for mRNA profiling is the use of AFLP-based transcript profiling as 
10 described in example 1. In this approach short sequence tags are monitored. In a next step 
these short sequence tags may be matched with full-length genes/proteins if required. Gene or 
protein selection thus be based on either full-length or partial sequences and it is well within 
the realm of the person skilled in the art to find a full length sequence based on the knowledge 
of a partial sequence. 

15 Therefore, one aspect of the invention is the direct use of genetic information to select 
candidate targets for agrochemicals. As mentioned above this genetic information can be 
generated by a number of techniques. Accordingly, the present invention encompasses a 
method as mentioned above, wherein the expression profiles are determined by means of 
micro-array, macro array or c-DNA-AFLP. 

20 

According to another embodiment of the invention, proteomic based approaches may be used 
to identify candidate target proteins for agrochemicals. 

It is now demonstrated that for the purposes of identifying a target gene for agrochemicals a 
25 synchronized culture of dividing plant cells is used to isolate samples and to monitor the 
expression of the transcripts of those cells during the progression of the cell division. 

Therefore according to a particular embodiment, the invention also encompasses a method for 
the identification and validation of plant agrochemical targets, wherein said gene or protein 
30 expression profiling is based on nucleic acid or protein samples collected from a synchronized 
culture of dividing plant cells. 

In one embodiment of the invention, the samples used for expression profiling are obtained 
from a synchronized culture of rice cells, tobacco cells, Arabidopsis cells or cells from any 
35 other plant species. The cell culture should be synchronized in order to obtain samples 
containing a sufficient amount of cells that are at the same stage of the biological process, so 
that the various samples taken for expression profiling are representative for the various 

5 
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stages of the essential biological process. In a particular embodiment of the present invention 
the samples are obtained from cells that are synchronized for cell division. In a preferred 
embodiment of the invention expression profiling is done on synchronized dividing cells. 
Certain cell lines are particularly suitable for synchronization of cell division, for instance 
5 synchronization of tobacco Bright Yellow-2 cell lines as described in example 1. Therefore 
most preferably, the synchronized cells are tobacco BY2 cells. By using synchronized tobacco 
BY2 cells and performing a cDNA-AFLP-based genome-wide expression analysis, the 
inventors built a large collection of plant cell cycle-modulated genes/proteins. Approximately 
1340 periodically expressed genes/proteins were identified, including known cell cycle control 

10 genes as well as numerous novel genes. A number of plant-specific genes were found for the 
first time to be cell cycle modulated. Other transcript tags were derived from unknown plant 
genes showing homology to cell cycle-regulatory genes of other organisms. Many of the genes 
encode novel or uncharacterised proteins, indicating that several processes underlying cell 
division are still largely unknown. These sequences are presented herein as SEQ ID NO 1 to 

15 SEQ ID NO 785. 

While, according to the invention, the basic criterion for identifying an agrochemical target 
gene or gene product consists in the differential expression levels of the gene or the protein 
observed during the progression of an essential biological progress, secondary selection 
20 criteria can be used and combined with this primary criterion. 

One such secondary criterion may be to make a selection of genes or proteins that are found 
not to exhibit a high degree of homology with genes or proteins from other organisms (such as 
mammals) as this criterion is likely to reduce the probability that the agrochemical compounds 
25 active on the "plant-specific" target genes or gene products would also exhibit toxic effects 
against other organisms, for example mammals. 

Another secondary selection criterion could exist in focussing on a particular phase of the 
essential biological process as mentioned above. For instance, when cell division modulated 
genes/proteins are under investigation as potential agrochemical target genes/proteins, one 

30 could preferably use those cell division modulated genes/proteins which exhibit high 
expression during the G1 phase, S phase, G2 phase or M phase or at the transition stages of 
these phases. In one embodiment of. the present invention, the focus may be on the G2/M 
transition phase, since this phase in the plant cell cycle is considered to have more "plant 
specific" elements than other phases of the cell cycle and is therefore more likely to yield plant 

35 specific candidate target genes and proteins. Whereas the core cell cycle genes/proteins and 
the basic regulatory mechanisms controlling cell cycle progression are conserved among 
higher eukaryotes, basic developmental differences between plants and other organisms imply 
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. that plant-specific regulatory pathways exist that control cell division. Especially for events 

occurring at mitosis, plants are expected to have developed unique mechanisms regulating 

karyo- and cytokinesis. A typical plant cell is surrounded by a rigid wall and can as such not 

divide by constriction. Instead, a new cell wall between daughter nuclei is formed by a unique 

5 cytoskeletal structure called the phragmoplast, whose position is dictated by another 

cytoskeletal array called the preprophase band. Another major difference between plant and 

animal mitosis is found in the structure of the mitotic spindles: in animals, they are tightly 

centred at the centrosome, whereas in plants they have a diffuse appearance. 

Therefore a suitable second criterion to combine with the first criterion may be to select 

10 . genes/proteins that are involved in the mitosis step of the cell cycle and/or that are involved in 

the building of the cell wall during mitosis. 

Likewise a secondary selection criterion to be combined with the first criterion may be the 
selection of genes or proteins from a dicotyledonous plant that do not exhibit a high degree of 
homology with genes or proteins from a monpcotyledonous plant (or vice versa). This 
15 secondary criterion is especially relevant when identifying agrochemical target genes or 
proteins with the intention to selectively identify targets that would allow for subsequence 
screening of selective herbicides or plant growth regulators. For instance, this strategy is 
advantageous to find targets and agrochemicals for selective weed control, such as herbicides 
that kill dicotyledonous weeds in monocotyledonous crops or vice versa. 

20 

Therefore according to further embodiments, the present invention encompasses methods as 
mentioned above, wherein the target gene or protein meets any one or more of the above 
mentioned secondary selection criteria, such as being plant specific, being mitosis specific or 
being dicot specific etc. 

25 

The possibility for combination of criteria used for selecting target genes or proteins renders 
the method of the present invention more powerful than classical methods. According to a 
preferred embodiment the technique of the present invention allows identifying genes/proteins, 
to be used as agrochemical target genes/proteins, these genes being genes/proteins that are 
30 involved in cell division and control of cell cycle progression, and these genes being novel and 
these genes being plant specific. Therefore the method of the present invention is 
characterized in that it allows identifying new and unexpected agrochemical targets. 

In the target gene identification step according to the present invention, genes or proteins are 
35 selected for which there is a high probability of being essential. It should be clear that the 
above-mentioned examples are given by way of illustration and are not meant to be limiting in 
any way. 
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Further, according to a second step in the method of the invention, the candidate 
agrochemical target gene or gene product is subsequently validated as being essential for the 
growth and/or development and/or viability of the organism. This is achieved by cloning the 
5 identified candidate target gene in a vector construct designed to downregulate said target 
gene in a plant or plant cell, followed by inoculating the plant with this construct and monitoring 
whether downregulation of the gene results in negative effects on plant growth and/or 
development and/or viability. A valid target gene is a target gene that causes significant effects 
on growth of plants or plant cells when downregulated. The present application describes for 
10 the first time the use of a particularly fast and efficient downregulation method to validate 
possible agrochemical targets. 

Accordingly, the present invention encompasses a method as mentioned above for the 
identification and validation of plant targets for agrochemicals. wherein said downregulation 
1 5 involves a viral-induced gene silencing mechanism. 

Thus, starting from a number of candidate target genes/proteins identified in the first step of 
the method of the invention, the target validation step aims at confirming and demonstrating 
the essential nature of the gene by demonstrating that severe down-regulation of the 

20 expression level of the gene has a significant effect on the organism. 

In particular, when one is interested in developing a screening assay for herbicides, 
downregulation of the candidate target gene in a plant may result in a lethal effect, a severe 
inhibition of plant growth or any other (obviously) negative phenotypic effects. Alternatively, 
when one is interested in developing a screening assay for plant growth regulators, the effect 

25 of downregulating the target gene may be modulation or even stimulation of growth in general 
or modulation or even stimulation of a particular process associated with plant growth and/or 
development and/or architecture and/or physiology and/or biochemistry or any other 
phenotypic effect. 

30 The man skilled in the art will be aware of various methods to achieve downregulation of a 
given gene or protein, such methods include essentially co-suppression based approaches or 
anti-sense based approaches as well as any other method resulting in gene silencing. Other 
examples of downregulation in a cell are well documented in the art and include, for example, 
RNAi techniques, the use of ribozymes etc. Gene silencing may also be achieved by insertion 

35 mutagenesis (for example, T-DNA insertion or transposon insertion) or by gene silencing 
strategies as described by. among others, Angell and Baulcombe, 1998 (WO 98/36083), Lowe 
et a/., 1989 (WO 98/53083), Lederer et a/., 1999 (WO 99/15682) or Wang ef a/., 1999 (WO 

8 
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99/53050). Expression of an endogenous gene may also be reduced if the endogenous gene 
contains a mutation. 



The effect of gene downregulation can be observed in stably transformed plants which can be 
5 obtained by means of various well known techniques, these techniques generally involving a 
plant transformation step and a plant regeneration step. 

Genes/proteins which exhibit a severe negative effect when downregulated may however 
significantly reduce transformation and/or regeneration efficiency. Therefore, a relevant 
parameter indicative for the essential nature of the gene, may be a severe reduction in 
10 transformation efficiency when said particular gene is used in a down-regulation construct. In 
order to avoid the (negative) effect on transformation efficiency in the transformation and 
regeneration process, an inducible promoter system can be used. Induction of promoter 
activity can then be applied at a later stage (after transformation) in order to observe the effect 
of gene downregulation once the transformed plant or plantlet started to develop. 

15 

Further, another method for testing the effect of downregulation of a target gene, which can be 
used in the methods of the present invention, is based on a rapid transient transformation 
process and does not rely on the somewhat lengthy process of stable transformation. The use 
of this method for target validation in plants is part of this invention, regardless of whether 
20 target identification has been performed according to this invention. 

Accordingly, in a preferred embodiment, the downregulation method is based on co- 
suppression and on rapid transient transfection of plant cells. The preferred method to validate 
genes/proteins as targets for agrochemicals is based on the cloning of the identified candidate 

25 target gene in a vector construct containing a viral replicase that is involved in the very efficient 
downregulation of the candidate target gene in the infected plant or plant cell via the 
mechanism of co-suppression. One advantage of this method for downregulation, is the fact 
that the infection of the host cells or the plant can be performed locally for example by 
inoculating the vector directly on the leaves. This allows a very fast evaluation of the effect of 

30 downregulating the candidate target since no complete transgenic plants have to be 
generated. Also this technique allows an easy way of monitoring the effect of the 
downregulated candidate target by simply looking at the changes of the infected place, for 
example monitoring the lethal effects on the infected leaf). 

35 Therefore in a preferred embodiment, the downregulation method is based on co-suppression. 
In a more preferred embodiment of the invention this co-suppression technique is fast and 
easy to evaluate the effect of downregulation, so that it is suitable for dealing with high 

9 
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numbers of genes/proteins. This can be achieved by using viral induces gene silencing 
mechanisms (VIGS) and by infecting the plant directly and locally, for example on the leaves. 
Therefore, according to another embodiment, the present invention relates to the use of a viral- 
induced gene silencing system for validating plant targets for agrochemicals. 

5 

This method for severe downregulation via transient expression of the gene in the presence of 
certain viral elements is referred to as Virus-induced gene silencing mechanism" (VIGS) and is 
previously described in Ratcliff et a/., Plant J., 25 237 - 245, 2001. Briefly, virus vectors 
carrying host-derived sequence inserts induce silencing of the corresponding genes/proteins in 

10 infected plants. This virus-induced gene silencing is a manifestation of an RNA-mediated 
defence mechanism that is related to post-transcriptional gene silencing in transgenic plants. 
Ratcliff et a/., developed an infectious cDNA clone of Tobacco rattle virus (TRV) that has been 
modified to facilitate insertion of non-viral sequences and subsequent infection in plants. This 
vector mediates VIGS of endogenous genes/proteins in the absence of virus-induced 

15 symptoms. Unlike the other RNA virus vectors that have been used previously for VIGS, the 
TRV construct is able to target most RNA's in the growing points of the plant. A more detailed 
description of this downregulation mechanism is given in example 2. 

According to particular embodiments of the present invention, the VIGS system is applied in 
20 Arabidopsis or in tobacco for the purposes of validation of a candidate agrochemical target 
gene. 

According to a further preferred embodiment, there is provided a method for validation of a 
candidate agriochemical target gene, wherein the gene is downregulated in. a plant via the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 

25 

The present invention relates to a combination of the above-mentioned identification and 
validation steps, which are especially selected so that they lead to an efficient selection of 
candidate target genes for agrochemicals. The outcome of the transcript profiling provides the 
necessary information and forms the basis for the second step, namely the validation of the 

30 target gene via incorporation of the gene sequence in the downregulation construct. The 
combination of these two techniques is especially useful for selecting suitable target 
genes/proteins for agrochemicals in a high throughput fashion. This technique thus overcomes 
the technical limitations of previously described techniques such as the knock-out libraries and 
the antisense strategies without genetic information of the genes. This new combination offers 

35 a time-saving strategy for identification of a candidate target gene and the more direct 
information output in the form of a real sequence, the immediate cloning of the gene in the 
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downregulation construct and immediate application of the downregulating construct on the 
target organism. 

The combination of these steps offers the unique opportunity to provide many high quality 
target genes/proteins for agrochemicals in a commercially and economically advantageous 
5 way. Furthermore, inherent to the techniques of the present invention is that the qualified 
target genes/proteins are accompanied with the necessary information to design a suitable in 
vitro screening assay with the agrochemical. This information consists of the expression 
characteristics of the genes/proteins and their function and importance in the essential 
biological process that was monitored during the transcript profiling. 
10 In this way, the methods of the present invention overcome the practical and commercial 
limitations of the existing techniques. 

Once this level of target validation is reached, the validated target can be selected for the 
development of an appropriate high-throughput in vitro screening method, wherein the 

15 agrochemical is tested. Therefore, the present invention also encompasses a method for 
screening candidate agrochemical compounds, comprising the use of any of the identification 
procedures and/or validation procedures as mentioned above. More particularly, the present 
invention encompasses a method for screening agrochemical compounds, comprising the use 
of any one or more of the sequences represented in SEQ ID NO 1 to 785. 

20 Various methods can be used to develop suitable in vitro assays for screening the chemical 
compounds, depending on what is known about the biological activity of the target gene. For 
example, when the target is an enzyme, measurement of the enzymatic activity of the target 
could form the basis of the in vitro screening assay with the chemical compound. 

25 Therefore, the methods of the present invention, the genes/proteins and the information 
generated by the combined identification and validation methods of the present invention, 
allow one to design and/or fine tune a screening for testing and/or developing agrochemicals 
(for example herbicides). For example if the expression pattern and the role of the target gene 
in the essential biological process is known, it is much easier to set up an in vitro screening 

30 assay to monitor the effect of a candidate herbicide on the target cells. Therefore it is expected 
that much more refined and/or efficient herbicides will be characterized using the methods of 
the present invention. 

Also because of the knowledge of its function, one can further design the screened 
35 agrochemical compound to improve its activity for instance to improve its binding capacity to 
the target. 

11 
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Therefore, the present invention encompasses a method for screening candidate agrochemical 
compounds comprising the use of any of the methods as mentioned above. 



The invention may also be applied for the development of agrochemical (for example herbicide 
5 or pesticide) tolerant plants, plant tissues, plant seeds and plant cells. 

Herbicides that exhibit greater potency can also have greater crop phytotoxicity. A solution to 
this problem is to develop crops that are resistant or tolerant to herbicides. Crop hybrids or 
varieties that are tolerant to the herbicides allow, for instance, for the use of herbicides that kill 
weeds without attendant risk of damaging the crop. Further it should be clear that when a plant 
10 is overexpressing the target of a particular herbicide, the tolerance of said plant against said 
herbicide will also be enhanced. 

Therefore the present invention also relates to the use of the agrochemical (e.g. herbicide) 
target genes/proteins as identified by the method of the present invention for generating 
15 transgenic plants that are tolerant or resistant to an agrochemical (e.g. herbicide). Example of 
genes and gene sequences identified by the combined identification and validation methods of 
the present invention and which can be used as agrochemical target or that can be used to 
obtain herbicide tolerant plants comprise the sequences as represented in any of SEQ ID NOs 
1 to 785. 

20 These sequences are derived from tobacco, but the one skilled in the art can easily find via 
homology search in databases or homology search in a cDNA library the homologues genes of 
other plant species, for instance monocot sequences (e.g the corresponding rice or corn 
sequence), and use them for the same purposes as described herein. These homology 
searches can be done for example with a BLAST program (Altschul et al. t Nucl. Acids Res., 25 

25 3389 - 3402, 1997) on a sequence database such as the GenBank database. Homology 
studies as referred to above can be performed using sequences present in public and/or 
proprietary databases and using several bioinformatics algorithms, well known to the man 
skilled in the art. Methods for the alignment of sequences are well known in the art, such 
methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the algorithm of 

30 Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of two 
complete sequences that maximizes the number of matches and minimizes the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The software for performing BLAST 
analysis is publicly available through the National Centre for Biotechnology Information. 

35 

Further, some of the tobacco sequences identified by the method of the present invention 
might be partial but again, the full-length sequence can easily be found based on the partial 
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sequence. For example "transcript building" can be done based on homology search on EST 
databases, cDNA's or gene predictions. These databases and programs are publicly available 
e.g. http://www.tiar. org/ . 

Therefore the present invention relates to the use of the nucleic acids as identified and 
5 disclosed herein and represented in SEQ ID NO 1 to 785, and also to the use of the full length 
genes regenerated from the partial sequences as well as to the use of the homologues 
sequences isolated from the same or from other plants. 

In another embodiment, the present invention relates to a nucleic acid identified according to 
10 the method of the invention. Thus the invention encompasses an isolated nucleic acid 
identifiable by any of the methods as mentioned above. 

In another embodiment, the invention relates to a nucleic acid identified according to the 
method of the invention, comprising the nucleic acid sequence chosen from the group of SEQ 
15 ID NO 1 to 785 or a full length sequence thereof, or a functional homologue thereof, or a 
functional fragment thereof, or an immunologically active fragment thereof. Thus the invention 
encompasses an isolated nucleic acid, comprising at least part of a nucleic acid sequence 
chosen from the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative 
thereof. 

20 With "a functional fragment" is meant any part of the sequence that is responsible for the 
biological function or for an aspect of the biological function of the nucleic acid sequence. 

Further, the invention encompasses a method for the production of an agrochemical resistant 
plant, comprising the use of any one or more of SEQ ID NO 1 to 785 or a homologue, 
25 functional fragment or derivative thereof or one or more of the proteins encoded by SEQ ID NO 
1 to 785 or a homologue, functional fragment or derivative thereof. 

In one embodiment of the present invention the sequences, the full-length sequences and the 
homologues are used to develop herbicide tolerant plants. 



30 



35 



Further the invention encompasses a plant tolerant to an agrochemical, in which the 
expression level of one or more of the nucleic acids corresponding the SEQ ID NO 1 to 785 or 
the homologue, functional fragment or derivative thereof, is modulated. Further the invention 
encompasses any part or more preferably any harvestable part of these plants. 

Therefore the invention also relates to the use of these sequences, the full-length sequences 
and the homologues as targets for agrochemicals The invention encompasses the use of a 
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nucleic acid as mentioned above or the protein encoded by said isolated nucleic acid as a 
target for an agrochemical compound, preferably, wherein the agrochemical compound is a 
herbicide. 

5 Further, the invention relates to the use of these sequences to develop screening assays for 
the identification and/or development of agrochemicals. The invention encompasses a method 
for screening candidate agrochemical compounds comprising the use of any one or more of 
SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative thereof or one or more 
of the proteins corresponding to SEQ ID NO 1 to 785 or a homologue, functional fragment or 
1 0 derivative thereof. 

The present invention will be further illustrated by the following figures, wherein, 
Figure 1 shows the gene expression profiles obtained by quality-based clustering of all 
transcript tags monitored in a transcript profiling experiment as described in example 1. Shown 
15 are the trend lines of 16 clusters containing 97% of the genes and covering the entire time 
course as indicated on top. S-phase-specific gene clusters are grouped in A, gene clusters 
with peak expression between S- and M-phase are grouped in B, whereas group C contains 
the M- and G1 -phase-specific clusters. D: Three small clusters of genes with peak expression 
during two cell cycle phases. 

20 

Figure 2 shows the phenotypes of tobacco plants inoculated with a acetolactate synthase 
(SEQ ID NO 18) downregulation construct and phenotypes of tobacco plants inoculated with a 
prohibitin (SEQ ID NO 21) downregulation construct. The phenotypes were observed 12 days 
after inoculation (upper panel) or 1 7 days after inoculation (lower panel). 

Figure 3 shows the phenotype of tobacco plants inoculated with a B-type CDK (SEQ ID NO 
11) donwregulation contruct. The observations were made 37 days after inoculation. 

Figure 4 shows the sequences identified by the methods of the present invention and 
30 represented by SEQ ID NO 1 to SEQ ID NO 785 

EXAMPLES 
Example 1 

A cDNA-AFLP based expression profiling of sequence obtained from samples of a 
35 synchronized tobacco BY2 cell line system, was used to identify genes that are upregulated 
during the cell cycle, an essential biological process needed for the viability and growth of the 
tobacco cell line system. 

14 
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A genome-wide expression analysis of cell cycle-modulated genes in the tobacco Bright 
Yellow-2 (BY2) cell line was performed. This unique cell line can be synchronized to high 
levels with different types of inhibitors of cell cycle progression (Nagata et a/., Int. Rev. Cytol., 
132 1 - 30, 1992; Planchais et a/., FEBS Lett., 476 78 -83, 2000). Because of the lack of 
5 extensive molecular resources such as genomic sequences, cDNA clones or expressed 
sequence tags (ESTs) for tobacco, a microarray-based approach cannot be used for a 
transcriptome analysis. Therefore, the cDNA-AFLP technology was used to identify and 
characterize cell cycle-modulated genes in BY2. cDNA-AFLP is a sensitive and reproducible 
fragment-based technology that has a number of advantages over other methods for 

10 genome-wide expression analysis (Breyne and Zabeau, Curr. Opin. Plant Biol., 4 136- 142, 
2001): it does not require prior sequence information, it allows identification of novel genes, 
and it provides quantitative expression profiles. After a detailed analysis, it was found that 
around 10% of the transcripts analyzed is periodically expressed. This comprehensive 
collection of plant cell cycle-modulated genes provides a basis for selecting and validating 

1 5 novel and unexpected agrochemical target genes 



Synchronization of BY2 cells and sampling of material. Tobacco BY2 (Nicotiana tabacum 
L. cv. Bright Yellow-2) cultured cell suspension were synchronized by blocking cells in early 
S-phase with aphidicolin as follows. Cultured cell suspension of Nicotiana tabacum L cv. Bright 

20 Yellow 2 were maintained as described (Nagata et a/., Int. Rev. Cytol., 132 1 - 30, 1992). For 
synchronization, a 7-day-oId stationary culture was diluted 10-fold in fresh medium 
supplemented with aphidicolin (Sigma-Aldrich, St. Louis, MO; 5 mg/l), a DNA-polymerase a 
inhibiting drug. After 24 h, cells were released from the block by several washings with fresh 
medium and resumed their cell cycle progression. After the drug had been washed, samples 

25 were taken every hour, starting from the release from the aphidicolin block (time 0) until 1 1 h 

later. The mitotic index was determined by counting the number of cells undergoing mitosis %J 
under fluorescence microscopy after the DNA had been stained with 5 mg/l 
4\6-diamidino-2-phenyfindole (Sigma-Aldrich). DNA content was measured by flow cytometry. 
This was done as follows A subsample was used to check cell cycle progression and synchrony 

30 levels. After the DNA had been stained with 5 mg/l 4\6-diamidino-2-phenylindole 
(Sigma-Aldrich), the mitotic index was determined under fluorescence microscopy by counting 
the number of cells undergoing mitosis. A mitotic peak of approximately 40% was obtained 8 h 
after washing. For flow cytometry, cells were first incubated in a buffered enzyme solution (2% 
cellulase and 0.1% pectolyase in 0.66 M sorbitol) for 20 min at 37°C. After the suspension had 

35 been washed and resuspended in Galbraith buffer (Galbraith et a/., Science, 220 1049 - 1051, 
1983), it was filtered through a 30-jjrn nylon mesh to purify the DAPI-stained nuclei. The 
fluorescence intensity was measured using a BRYTE HS flow cytometer (Bio-Rad, Hercules, 
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CA). Exit from S-phase was observed 4 h after aphidicolin release and the level of synchrony 
was shown to be sufficiently high throughout the time course. 

RNA extraction and cDNA synthesis. Total RNA was prepared by using LiCI precipitation 
5 (Sambrook et al., 1989) and poly(A + ) RNA was extracted from 500 ng of total RNA using 
Oligotex columns (Qiagen, Hilden, Germany) according to the manufacturer's instructions. 
Starting from 1 ug of poly(A*) RNA, first-strand cDNA was synthesized by reverse transcription 
with a biotinylated oligo-dT 25 primer (Genset, Paris, France) and Superscript II (Life 
Technologies, Gaithersburg, MD). Second-strand synthesis was done by strand displacement 
10 with Escherichia coli ligase (Life Technologies), DNA polymerase I (USB, Cleveland, OH) and 
RNAse-H (USB). 

cDNA-AFLP analysis. Five hundred ng of double-stranded cDNA was used for AFLP analysis 
as described (Vos et al., Nucl. Acids Res., 23 4407 - 4414, 1995; Bachem et al., Plant J., 9 745 

15 - 753, 1996) with modifications. The restriction enzymes used were BsfYI and Msel (Biolabs) 
and the digestion was done in two separate steps. After the first restriction digest with one of the 
enzymes, the 3' end fragments were collected on Dyna beads (Dynal, Oslo, Norway) by means 
of their biotinylated tail, while the other fragments were washed away. After digestion with the 
second enzyme, the released restriction fragments were collected and used as templates in the 

20 subsequent AFLP steps. The adapters used were: for BsfYI, 5'-CTCGTAGACTGCGTAGT-3' 
and 5'-G ATCACTACGCAGTCTAC-3' , and for Msel, 5'-GACGATGAGTCCTGAG-3' and 
5-TACTCAGGACTCAT-3'; the primers for BsfYI and Msel were 
5 , -GACTGCGTAGTGATC(T/C)N 1 . 2 -3* and 5'- GATGAGTCCTGAGTAANL2-3', respectively. For 
preamplifications, a Msel primer without selective nucleotides was combined with a BsfYI primer 

25 containing either a T or a C as 3' most nucleotide. PCR conditions were as described Vos ef a/., 
Nucl. Acids Res., 23 4407 - 4414, 1995). The obtained amplification mixtures were diluted 
600-fold and 5 ul was used for selective amplifications using a P^-labeled BsfYI primer and the 
Amplitaq-Gold polymerase (Roche Diagnostics, Brussels, Belgium). Amplification products were 
separated on 5% polyacrylamide gels using the Sequigel system (Biorad). Dried gels were 

30 exposed to Kodak Biomax films as well as scanned in a phospholmager (Amersham Pharmacia 
Biotech, Little Chalfont, UK). 

Quantitative measurements of the expression profiles and data analysis. Gel images 
were analyzed quantitatively with the AFLP-QuantarPro image analysis software (Keygene 
35 N.V., Wageningen, The Netherlands). This software was designed for accurate lane definition, 
fragment detection, and quantification of band intensities. All visible AFLP fragments were 
scored and individual band intensities were measured per lane. The obtained data were used to 
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determine the quantitative expression profile of each transcript. The raw data were corrected 
for differences in total lane intensities, after which each individual gene expression profile was 
variance-normalized . This was done as follows. 

The obtained raw data were first corrected for differences in total lane intensities which may 

5 occur due to loading errors or differences in the efficiency of PCR amplification with a given 
primer combination for one or more time points. The correction factors were calculated based on 
constant bands throughout the time course. For each primer combination, a minimum of 
10 invariable bands was selected and the intensity values were summed per lane. Each of the 
summed values was divided by the maximal summed value to give the correction factors. Finally, 

1 0 all raw values generated by QuantarPro were divided by these correction factors. 

Subsequently, each individual gene expression profile was variance-normalized by standard 
statistical approaches as used for microarray-derived data (Tavazoie et a/., Nature Genet., 22 
281 - 28.5, 1999). For each transcript, the mean expression value across the time course was 
subtracted from each individual data point after which the obtained value was divided by the 

15 standard deviation. A coefficient of variation (CV) was calculated by dividing the standard 
deviation by the mean. This CV was used to establish a cut-off value and all expression profiles 
with a CV less than 0.25 were considered as constitutive throughout the time course. 
The Cluster and TreeView software (Eisen et a/., PNAS, 95 14863 - 14868, 1998) was used 
for hierarchical, average linkage clustering. Quality-based clustering was done with a newly 

20 developed software program (De Smet et al., Bioinformatics 2002 May; 18(5): 735-46). This 
program is related to K-means clustering, except that the number of clusters does not need to 
be defined in advance and that the expression profiles that do not fit in any cluster are 
rejected. The minimal number of tags in a cluster and the required probability of genes 
belonging to a cluster were set to 10 and 0.95, respectively. With these parameters, 86% of all 

25 the tags were grouped in 21 distinct clusters. 

Characterization of AFLP fragments. Bands corresponding to differentially expressed 
transcripts were isolated from the gel and eluted DNA was reamplified under the same 
conditions as for selective amplification. Sequence information was obtained either by direct 

30 sequencing of the reamplified polymerase chain reaction product with the selective BstY\ 
primer or after cloning the fragments in pGEM-T easy (Promega, Madison, Wl) or sequencing 
of individual clones. The obtained sequences were compared against nucleotide and protein 
sequences present in the publicly available databases by BLAST sequence alignments 
(Altschul et al., Nucl. Acids Res., 25 3389 - 3402, 1997). When available, tag sequences were 

35 replaced with longer EST or isolated cDNA sequences to increase the chance of finding 
significant homology. Based on the homology, transcript tags were classified in functional 
groups as shown in Table 1 . 
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Experimental Results 

Identification and characterization of cell cycle-modulated genes 

Tobacco BY2 cells were synchronized by blocking cells in early S-phase with aphidicolin, an 
5 inhibitor of DNA polymerase a. After the inhibitor had been released, 12 time points with an 1-h 
interval were sampled, covering the cell cycle from S-phase until M-to-G1 transition. Flow 
cytometry and determination of the mitotic index showed that the majority of cells exit S-phase 
4 h after release from blocking and that the peak of mitosis is reached at 8 h. From each time 
point, extracted mRNA was subjected to cDNA-AFLP-based transcript profiling. 

10 Quantitative temporal accumulation patterns of approximately 10,000 transcript tags were 
determined and analyzed. In total, around 1 ,340 transcript tags were modulated significantly 
during the cell cycle. Hierarchical clustering of the expression profiles resulted in four large 
groups with the peak of expression in S-, early G2-, late G2-, or M-phase. Within each of these 
groups, several smaller clusters of genes with similar expression patterns could be 

15 distinguished. By quality-based clustering 21 different clusters were identified (see: 
http://www.plantgenetics/genomics/CCMgenes). In agreement with the hierarchical clustering, 
the four largest clusters (clusters 1 to 4 in Fig. 1) correspond to the S-, early G2-, late G2-, and 
M-phases and together contain 65% of all the tags. An additional cluster (cluster 5 in Fig. 1C), 
not clearly separated in the hierarchical clustering, includes the genes with peak expression in 

20 G1 -phase and contains another 5% of the tags. The remaining clusters are much smaller and 
most often (e.g., clusters 6, 9, 10, and 18) include genes with a narrow temporal expression 
pattern. In addition to these clusters, three small groups of genes displaying elevated 
expression during two cell cycle phases were distinguished also by quality-based clustering 
(Fig. 1 D). 

25 After the transcript tags had been sequenced, homology searches revealed that 36.5% of the 
tags were significantly homologous to genes of known functions, 13.1% of the tags matched a 
cDNA or genomic sequence without allocated function, whereas for 50.4% of the tags no 
homology with a known sequence was found. Genes of known function belong to diverse 
functional classes (Table 1) revealing that several biological processes are at least partially 

30 under temporal transcriptional control during the cell cycle in plants. In general, the observed 
transcript accumulation profiles and cell cycle specificity correlate well with the functional 
properties of the corresponding genes. It is interesting that the number of transcription factors 
with G2-phase specificity is high, which may be related with the induction of genes involved in 
M-phase-specific processes. The overrepresentation of RNA-processing genes in the M-phase 

35 might indicate that post-transcriptional regulation is involved in gene activity during mitosis. 
Because de novo transcription is severely reduced during mitosis (Gottesfeld et a/., Trends 
Bioch. Sci., 22 197 - 202, 1997). RNA-processing could provide an alternative regulatory 
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mechanism. Intriguingly, transcript tags with homology to a gene of unknown function are 
overrepresented in the M-phase as well (Table 1). The principal differences in cell cycle events 
between plants and other organisms occur during mitosis; therefore, the inventors believe that 
several of these transcripts correspond to still uncharacterised plant-specific genes triggering 
these events. Remarkably, several of the tags homologous to a publicly available sequence 
have no Arabidopsis homologue, indicating that, in addition to conserved genes, different plant 
species possess also unique sets of cell cycle-modulated genes. Although many of these tags 
may be too short to significantly match with an Arabidopsis sequence, analysis of longer cDNA 
clones corresponding to a subset of tags has revealed that approximately 25% of the 
sequences remain novel. 

In Tables 1 to 4 a selection of 785 sequence tags are shown. This selection was based on the 
criterion if the tags were full length or that showed homology with genes known to be involved 
in the cell cycle (group 2 SEQ ID NOs 22 to 118), or on the criterion that they show homology 
with genes of unknown function (group 3 SEQ ID NOs 119 to 283) or on the criterion that the 
sequences showed no homology with the sequences in that existing databases (group 4 SEQ 
ID NOs 284-785). A first group (SEQ ID Nos 1 to 21) represent a smaller selection of tags 
which are used in the target validation method described in the present invention, more 
particularly, that were used in example 2. 

The core cell cycle machinery 

Several tags coincide with genes belonging to the core cell cycle machinery and exhibiting 
distinct expression profiles. Transcript tags from five B1- or B2-type cyclins as well as from a 
D2-type cyclin show mitotic accumulation and exhibit a narrow temporal expression profile, 
confirming previous studies (Mironov et a/., Plant Cell, 11 509 - 521, 1999; Sorrell et a/., Plant 
Physiol., 119 343 - 351, 1999). Based on the transcription patterns, the six A-type cyclins fall 
into three groups that sequentially appear during the cell cycle, adding new data to earlier 
observations (Reichheld et a/., PNAS, 93 13819 - 13824, 1996). Two groups have quite a 
broad window of transcript accumulation; one group, homologous to A3-type cyclins, is 
expressed during S-phase and disappears during G2-phase and the other group, 
corresponding to A2-type cyclins comes up at mid S-phase and goes down during M-phase, 
except for one transcript that is specific for S-phase. The third group, containing an A1-type 
cyclin, has the same expression pattern as the B- and D2-type cyclins. Several tags derived 
from genes encoding the plant-specific B-type cycl in-dependent kinases (CDKs) were also 
identified. CDKB1 and CDKB2 peak at the G2-to-M transition, slightly before the mitotic cyclins 
as describe (Porceddu et a/., J. Biol. Chem., 276 36354 - 36360, 2001). In contrast to what 
has been observed in partially synchronized alfalfa cell cultures (Magyar et ai, Plant Cell, 9 
223 - 235, 1997), the transcript levels of the tags homologous to a C-type CDK accumulate 
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differentially during the cell cycle. The transcripts are present during late M-phase and early 
S-phase, suggesting that CDKC is active during the G1 -phase. 

In addition to these well-characterized cell cycle-regulatory genes, also several tags were 
identified herein derived from genes encoding transcription factors and protein kinases or 
5 phosphatases with a known or putative role in cell cycle control. One tag with a sharp peak of 
transcript accumulation 1 h before the B- and D-type cyclins corresponds to a 3R-MYB 
transcription factor. Recently, a 3R-MYB has been shown to activate B-type cyclins and other 
genes with a so-called M-phase-specific activator domain (Ito et a/., Plant Cell, 13 1891 - 
1905, 2001). Another tag peaking in M-phase is homologous to the CCR4 associated protein 
10 CAF. CAF forms a complex with CCR4 and DBF2, resulting in a transcriptional activator 
involved in the regulation of diverse processes including cell wall integrity, methionine 
biosynthesis and M-to-G1 transition (Liu et a/., EM BO J., 16 5289 - 5298, 1997). A majority of 
the tags with similarity to protein kinases and phosphatases show M-phase-specific 
accumulation (Table 1). Although the true identity and putative cell cycle related function 
15 remains unclear for the majority, one is highly homologous to a dual-specificity phosphatase. 
This type of phosphatases plays a crucial role in cell cycle control in yeast and animals 
(Coleman and Dunphy, Curr. Opin. Cell Biol., 6 877 - 882, 1994). Another M-phase-specific 
tag is homologous to prohibitin. In the mammalian cell cycle, prohibitin represses 
E2F-mediated transcription via interaction with retinoblastoma (Rb), thereby blocking cellular 
20 proliferation (Wang et al., Oncogene, 1 8 3501 - 3510, 1999). 

Protein degradation by the ubiquitin-proteasome pathway also plays an important role in the 
control of cell cycle progression at both G1-to-S transition and exit from mitosis. Although there 
is little evidence for cell cycle-modulated expression of the genes encoding the various 
components of the ubiquitin-proteasome complexes, some proteins accumulate in a cell 
25 cycle-dependent way (del Pozo and Estelle, Plant Mol. Biol., 44 123 - 128, 2000). 
Furthermore, several tags were isolated herein from genes encoding ubiquitin-conjugating 
enzyme (E3), ubiquitin-protein ligase (E2), and proteasome components with an 
M-phase-specific expression pattern. Another transcript tag that accumulates during late 
M-phase is similar to cathepsin B-like proteins, which are proteolytically active and degrade 
30 diverse nuclear proteins, including Rb (Fu et al., FEBS Lett., 421 89 - 93, 1998). 

Whereas all the core cell cycle regulatory genes have been identified that control the G2-to-M 
transition for which the expression is known to be cell cycle modulated, genes such as Rb and 
E2F, controlling G1-to-S transition were not found. These genes were probably missed 
because the G1-to-S transition was not included in the present analysis, what is supported by 
35 the finding that the early targets of E2F, such as polymerase a and ribonucleotide reductase, 
are already present at high levels at the beginning of the time course. 
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Genes involved in DNA replication and modification 

In agreement with the studies performed in yeast and human fibroblasts, transcripts encoding 
proteins involved in DNA replication and modification accumulated during S-phase and 
exhibited broad temporal expression profiles. Different replication factors, DNA polymerase a. 
5 and the histones H3 and H4 are already present at the onset of the time course, indicating that 
they are induced before the time point of the aphidicolin arrest. Interestingly, most of the 
histones H1, H2A, and H2B appear somewhat later than H3 and H4, what might reflect that 
they are deposited into the nucleosomes after H3 and H4 (Luger ef ai, Nature, 389 251- 260, 
1997; Tyler et ai, Nature, 402 555 - 560, 1999). The profile of the homologue of the 
10 anti-silencing function 1 (ASF1) protein is similar to that of the histones H3 and H4, in 
agreement with the fact that the three proteins are part of the replication-coupling assembly 
factor complex that mediates chromatin assembly (Tyler ef a/., Nature, 402 555 - 560, 1999). 
Genes encoding high-mobility group proteins reach the highest accumulation during late G2, 
consistent with the subsequent steps involved in the folding and structuring of the chromatin. 
15 Tags derived from genes encoding proteins involved in DNA modification, such as 
S-adenosyl-L-methionine (SAM) synthase and cytosine-5-methyl- transferase are found in the 
histone cluster. Tags from methionine synthase genes, which provide the precursor for SAM 
synthase, accumulate during M-phase, in contrast to yeast, where these genes are expressed 
during late S-phase (Spellman et ai, Mol. Cell ^iol., 9 3273 - 3297, 1998). 
20 Genes involved in chromatin remodelling and transcriptional activation or repression have 
been identified as well. One gene is a histone deacetylase with highest transcript accumulation 
during the G2-phase and another belongs to the SNF2 family of chromodomain proteins with 
an M-phase-specific expression pattern. Interestingly, one tag corresponds to a mammalian 
inhibitor of growth 1 (p33-ING1) protein. The human ING1 protein has DNA-binding activity 
25 and might be involved in chromatin-mediated transcriptional regulation (Cheung and Li, Exp. 
Cell Res., 268 1 - 6, 2001). This protein accumulates during S-phase (Garkavtsev and 
Riabowel, Mol. Cell Biol., 17 2014 - 2019, 1997), what is in agreement with the expression 
profile we observed. The yeast homologues of ING1 are components of the histone 
acetyltransferase complex and show similarity to the Rb-binding protein 2 (Loewith et ai, Mol. 
30 Cell Biol., 20 3807 - 3816, 2000). Another tag, homologous to the Arabidopsis MS13 protein, 
follows a similar expression profile. MSI-like proteins are involved in the regulation of histone 
acetylation and deacetylation and in chromatin formation (Ach ef ai, Plant Cell, 9 1595 - 1606, 
1997). 

The expression profiles of the different ribonucleotide reductase (RNR) genes are more 
35 complex. One gene is already expressed at high levels at the beginning of the time course and 
its expression is restricted to the S-phase as described (Chaboute ef ai, Plant Mol. Biol., 38 
797 - 806, 1998), whereas, in contrast, another one is highly expressed in S-phase and 
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reappears at lower levels during M-phase and a third one is M-phase-specific. This latter 
expression profile has also been described for a RNR gene from Xenopus where the encoded 
protein appears to be involved in microtubulin nucleation (Takada et a/., Mot. Cell Biol., 11 
4173-4187, 2000). 

5 Numerous other transcript tags with S-phase specificity were found in addition to the ones 
involved in DNA replication and modification. Most interestingly, one of these tags is 
homologous to a mammalian gene encoding a TRAF-interacting protein (TRIP), which is a 
component of the tumor necrosis factor (TNF) signalling complex, and promotes cell death 
when compiexed with TRAF (Lee et a/., J. Exp. Medicine, 185 1275 - 1285, 1997). Another 
10 S-phase-specific tag shows homology to the RING finger domain of inhibitor of apoptosis 
proteins, which are also involved in the TNF signalling pathway. 

Modulated expression of genes required for mitosis and cytokinesis 

Several paralogous genes that encode either a- or (3-tubulin were highly induced and 
15 accumulated prior to the mitotic index peak or during early M-phase. The inventors found that 
in BY2, tubulin genes are highly cell cycle modulated. This transcriptional regulation is in 
agreement with previous demonstrations of de novo transcription of a- and p-tubulin genes 
during different cellular processes (Stotz et a/., Plant Mol. Biol., 41 601 - 614, 1999). In the 
present analysis, no y-tubulin genes were found, confirming published data that the amount of 
20 y-tubulin is constant in dividing BY2 cells (Stoppin-Mellet ef a/., Plant Biol., 2 290 - 296, 2000). 
Most of the kinesins identified herein, fall in the same cluster as the tubulins peaking prior to 
mitosis. Interestingly, two tags have a distinct transcription pattern and appear in another gene 
cluster. Their window of transcript accumulation is very narrow and coincides with the peak of 
mitosis. Most interestingly, these tags correspond to the plant-specific 
25 phragmoplast-associated type of kinesin, PAKRP1 (Lee and Liu, Curr. Biol., 10 797 - 800, 
2000). A chromokinesin not yet described in plants was identified as well. This type of motor 
proteins use DNA as cargo and play a role in chromosome segregation and metaphase 
alignment (Wang et a/., J. Cell Biol., 128 761 - 768, 1995). 

Among the M-phase-specific kinases, two were unambiguously recognized herein as playing a 
30 role in cytokinesis. One is Aurora, a protein kinase with a key role in the control of 
chromosome segregation, centrosome separation, and cytokinesis in yeast and animals 
(Bischoff and Plowman, Trends Cell Biol., 9 454-459, 1999) but not described in plants yet. 
The other is NRK1, a mitogen-activated protein kinase kinase which is phosphorylated by 
NPK1, a kinase involved in regulating the outward redistribution of phragmoplast microtubules 
35 (Nishihama et a/., Genes Dev., 15 352 - 363, 2001). 
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Hormonal regulation and cell cycle-modulated gene expression 

A number of genes belonging to the class of auxin-induced genes were also differentially 
expressed. Cell cycle-modulated expression of auxin-induced genes has never been observed 
before although auxins together with cytokinins are the two major groups of plant hormones 
that affect cell division (Stals and Inze, Trends Plant Sci., 6 359 - 364, 2001). The genes as 
identified herein fall into two groups based on their transcript accumulation profiles (data not 
shown). The first group displays an early S-phase-specific expression pattern and consists of 
the parA, parB and parC genes. Induction of the par genes is most often observed in response 
to stress conditions (Abel & Theologis, Plant Phys. 111, 9 - 17. 1996). The fact that the 
transcripts rapidly disappear after release from the cell cycle-blocking agent might indicate a 
stress response rather than a cell cycle dependent auxin response. 

More interesting is the second group of genes with transcripts accumulating during early 
M-phase. This group includes the auxin response factor 1 (ARF1), an auxin transporter as well 
as different members of the early auxin response AUX/IAA gene family. ARF1 is a transcription 
factor that binds to a particular auxin response element (Ulmasov et a/., Science, 276 1865 - 
1868, 1997). Additional studies suggest that the activity of ARF1 is controlled by its 
dimerization with members of the AUX1/IAA family (Walker and Estelle, Curr. Opin. Plant boil., 
1 434 _ 439,1998). The similarity in temporal expression profiles the inventors observed 
supports these findings and suggests that these proteins mediate an auxin response 
necessary for cell cycle progression 

By using tobacco BY2 as model system together with cDNA-AFLP-based transcript profiling, it 
is described herein for the first time how a comprehensive inventory of plant cell 
cycle-modulated genes can be made. Although the obtained data confirm earlier results and 
observations, in addition, numerous novel findings were made. The obtained data are a very 
useful basis for selecting and validating agrochemical target genes. 

Example 2 

In this example it is described how plant genes are evaluated for assessment of their essential 
character in the biological process, thus how they are validated as good candidate targets for 
agrochemicals. 

The Tobacco Rattle Virus (TVR) is used to induce silencing of target genes . In case of an 
essential gene the simlencing will result in a lethal effect on the plant and therefore, the 
suystem allows to validate good candidates as targets for herbicides . 

The TRV based system is used in this example in combination with series of candidate genes, 
more particularly with the candidate targets as represented herein as group 1 sequences 
consisting of the SEQ ID NOs 1 to 21. The identification technique of the present invention 
(see example 1) allowed to identify new genes that are potential new herbicide targets, 
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because of their putative function in various key processes crucial for cell life, their expression 
at a certain developmental stage crucial for cell life, their role in metabolism and/or 
maintenance of cell living state. 

This example illustrates the validation of these candidate genes as novel targets for 
5 agrochemicals, via the technique of the virus-induced gene silencing (VIGS). 

Gene silencing mechanism 

The virus-induced gene silencing (VIGS) is a manifestation of an RNA-mediated defence 
mechanism that is related to post-transcriptional gene silencing (PTGS) in transgenic plants 

10 (Ratcliff et a/., Plant J., 25 237 - 245, 2001). The method uses a vector with an infectious 
cDNA of tobacco rattle virus (TRV) modified (see below) to facilitate insertion of target 
sequences and modified for efficient infection of plants (e. g. tobacco). The vector mediates 
VIGS of endogenous genes in the absence of specific virus-induced symptoms. 
The RNA-mediated defence is triggered by the virus vectors, and targets both the viral genome 

15 and the host gene corresponding to the insert. As a result, the symptoms in the infected plant 
are similar to loss-of-function mutants or reduced-expression mutants in the host gene. The 
presence of a negative growth phenotype suggests that the targeted gene is a potential 
herbicide target. 

The process of constructing a virus vector and monitoring symptoms on infected plants is 
20 completed within a few weeks, such that virus-induced gene silencing (VIGS) provides a 
simple, rapid means of assigning function to genes that have been sequenced but are 
otherwise uncharacterized. The determination of new herbicide target genes is performed in a 
few weeks including gene cloning, transformation steps and tobacco plant analyses. 
The TRV construct is shown to target host RNAs in the growing points of plants (Ratcliff et a/., 
25 Plant J., 25 237 - 245, 2001) such as meristems and actively dividing cells. 

It has been shown that this vector overcomes many of the problem features of PVX, TMV and 
TGMV. For example, the TRV vector induces very mild symptoms, infects large areas of 
adjacent cells and silences gene expression in growing points such as meristems and actively 
dividing cells. Infection of tobacco plants on the leaves with TRV based constructs will affect 
30 growth and development of upper parts of the infected leaves and allow screening for growth 
parameters. 

Construction of TRV vectors used in the validation process of the present invention 

TRV is a positive-strand RNA virus with a bipartite genome. Proteins encoded by RNA 1 are 
35 sufficient for replication and movement within the host plant, while proteins encoded by RNA 2 
allow virion formation and nematode-mediated transmission between plants (reviewed by 
MacFarlane, J. Gen. Virol., 80 2799 - 2807,1999). 
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The dowhregulation system is composed of separate cDNA clones of TRV RNA 1 and RNA 2 
under the control of cauliflower mosaic virus (CaMV) 35S promoters on the transferred T-DNA 
of plant binary transformation vectors. 

The TRV RNA 1 construct (pBINTRA6) contains a full-length infectious cDNA clone in which 
5 the RNA polymerase ORF is interrupted by intron 3 of the Arabidopsis Col-0 nitrate reductase 
NIA1 gene (Wilkinson and Crawford, Mol. Gen. Genet., 239 289 - 297, 1993), necessary to 
prevent expression of a TRV-encoded protein that is toxic to E. coli. This vector has been 
given the internal reference number p3209. 

The TRV RNA 2 construct (pTVOO), contains a multiple cloning site (MCS), leaving only the 5' 
10 and 3' untranslated regions and the viral coat protein (Ratcliff et a/., Plant Cell, 11 1207 - 
1215, 1999). This vector has the internal reference number p3930 and contains a Gateway™ 
cassette and the gene of interest to be tested. The genes as presented in SEQ ID NO 1 to 21 
are each cloned in this vector. 

cDNAs were amplified using Gateway compatible primers and the cDNAs were entered into 
15 Entry Clones by BP recombination reactions. Subsequently the entry clones comprising the 
gene according to any one of SEQ ID NO 1 to 21 were checked via Ban2 restiction digest. The 
genes of interest were then entered into destination vectors by LR recombination reactions and 
the destination vectors were checked via ECORV restriction digestions. These expression 
clones were electroporated into the Argobacterium strain GV3101 agro and the plasmid 
20 pBintra6 was electroporated into pMP90 agro. 

Inoculation 

To inoculate plants, Agrobacterium cultures carrying pBINTRA6 (strain C58C1RifR containing 
pMP90 plasmid) and pTVOO (strain GV3101 containing pMP90 plasmid) were grown and 

25 mixed and infiltrated to the leaves of Nicotians benthamiana as previously described (English 
et a/., Plant J., 12 597 - 603, 1997). Briefly, virus infection was achieved by Agrobacterium- 
mediated transient gene expression. Agrobacterium containing the TRV cloning vectors were 
grown overnight in L brith (+Tc+Km), Agrobacterium containing the helper plasmid was grown 
overnight in 10 ml YEB+Rif+Km. The culture was centrifuged and resuspended in 10 ml of 

30 10mM MgCI 2 , 1mM MES-pH5.6 and 100pM acetosyringone and kept at room temperature for 
2 h. Separate cultures containing pBINTRA6 and TRV cloning vectors were mixed in a ratio of 
1:10. The culture was then infiltrated to the underside of two leaves of three-weeks old plants 
using a 2 ml syringe without a needle. In two independent experiments 6 plants per 
agroabcterium clone were infected. In this way the cloned genes (SEQ ID NO 1-21) were 

35 transferred into the cells of the infiltrated region, and could be transcribed into the viral cDNAs 
in the leave cells. These transcripts then serve as an inoculum to initiate systemic infection of 
the plant. Consequently the VIGS system is activated, resulting in the downregulation of the 
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host cell gene, corresponding to the cloned gene of interest. All experiments involving virus- 
infected material was carried out in controlled growth chambers. N. benthamiana plants were 
germinated ad grown individually on universal potting ground in pots at 25°C during the day 
(16h) and 20°C during the night (8h). 

5 

The plants were phenotypically evaluated on a daily basis. Particular attention was given to 
visible leaf damage and growth inhibition. The effects of the suppression of gene activity using 
the VIGS system is measured by the phenotypic aspect of the plants, including leaf defects 
such as growth retardation, yellow or necrotic spots, early senescence, etc. The effects of the 
10 downregulation of genes identified by the methods of the invention are also measured on the 
flower structure and the flowering capacities of the transformed plants. 

The severity of the phenotype is linked to the level of suppression of the geneactivity and 
indicates the degree in which the gene is essential for the plant Therefor the phenotype is an 
indication of the degree in which the gene is a valid target for a herbicide. 

15 

Phenotypes of the infected plants. 

1. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf growth modification, including leaf wrinkling, curling, wilting, 
leading to cell death and/or plant death. 

20 

2. Co-suppression of the geneleads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf yellowing or senescence, or cell degth and necrosis, 
leading to plant death. 

25 3. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following phenotypic symptoms: chlorotic regions 
around infection, crisp or crunchy leaf texture around infection, numerous surface lumps on 
either leaf surface, abnormal trichomes, abnormal leaf size, reduced growth, reduced final 
size, altered vascular leaf system, altered water movement in leaf , leading to cell death and/or 

30 plant death. 

4. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following anatomical symptoms: clumps of modified 
cells on the surface of the leaf (either abaxial or adaxial), individual cells detached from the 
35 epidermis, swollen or modified trichome cells, modification of leaf tissue structure, cell size, cell 
number, tissue composition, parenchyme, epidermis, etc , leading to cell death and/or plant 
death. 
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5. co-suppression of gene X leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following biochemical symptoms, enzyme activity 
and products, degradation of leaf components and effects in neighboring leaves, stem, 
5 vascular system, .degradation of cell wall structure, communication between cells, modification 
of cell-cell signaling leading to cell death and/or plant death. 

The genes identified by the present invention can be utilized to examine herbicide tolerance 
mechanisms in a variety of plants cells, including gymnosperms, monocots and dicots. It is 
particularly useful in crop plant cells such as rice, corn, wheat, barley, rye, sugar beet, etc 

10 

Example 3 

Significant phenotypic alterations could be observed in plants infiltrated with Agrobacterium 
containing pBINTRA6 + Bstt44-4-340 (SEQ ID NO 18, acetolactate synthetase) and 
pBINTRA6 + Bstt2-42-520 (or T4-32-7) (SEQ ID NO 21, prohibitin) and pBINTRA6 + Bstt23-4- Q 
15 230 (SEQ ID NO 11, B-type CDK). 

At 10days post-infiltration the first symptoms were visible. The symptoms were persistent until 
the end of the experiment and could be observed in at least 5 out of the 6 infiltrated plants. 

The phenotypes of the plants transformed with acetolactate synthase are further described. 

20 In two separate replicated experiments, specific phenotypes on each plant infected with the 
acetolactate synthetase downregulation construct were observed (Figure 2). Winkling and 
wrapping of the leaves as well as some chlorotic spots were observed. Thus acetolactate 
downregulation provoked a general growth arrest accompanied with chlorotic and necrotic 
areas. These observations were in line with previous reports, wherein acetolactate synthetase 

25 is described as a useful herbicide target. 

O 

The phenotypes of the plants transformed with prohibitin are further described. 
In two separate replicated experiments, specific phenotypes on each plant infected with the 
prohibitin downregulation construct were observed (Figure 2). These plants showed strong 
30 wrinkling of the leaves about 20 days after infection, corresponding to the expected occurrence 
of silencing events. Thus the downregulation of probibitin provokes a severe leaf distortion and 
general growth arrest. 

The phenotype of the plants inoculated with a B-type CDK downregulation construct are shown 
35 in Figure 3. A late (from 30 days after inoculation) but strong negative effect on the plant 
growth was observed. The plants started to grow much slower and lost their apical dominance, 
resulting in the increased appearance of lateral branches. 
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Table 1. Functional classification of transcript tags 





Function 


Tags 


S 


G2 


M 


G1 


5 






27.7% 


15.8% 


52.9% 


3.6% 




Cell cycle control 


30 


5/8 (0.078) 


8/5 (0.068) 


14/16 (0.114) 


3/1 




Cell wall 


35 


6/10 (0.047) 


4/6(0.136) 


25/18 (7/1 e- 3 ) 


0/1 




Cytoskeleton 


43 


1/12(1. 2e" s ) 


4/7 (0.090) 


38/22 (2.1 e" 7 ) 


0/2 


10 


Hormone response 


13 


6/4(0.113) 


1/2 (0.277) 


6/7 (0.185) 


0/0 




Kinases/phosphatases 1 


27 


4/8 (0.039) 


1/4 (0.059) 


19/14 (0.025) 


3/1 




Protein synthesis 


50 


15/14(0.116) 


5/8 (0.087) 


29/26 (0.079) 


1/2 




Proteolysis 


21 


2/6 (0.026) 


1/3(0.144) 


17/11 (0.039) 


1/1 




Replication and modification 


74 


57/20 (4.2e' 19 ) 


8/12 (1.0e -5 ) 


8/39 (I.Oe* 18 ) 


1/3 


15 


RNA processing 


20 


1/6 (6.8e-3) 


1/3 (0.137) 


18/11 (8.1 e" 4 ) 


0/0 




Signal transduction 


10 


1/3(0.121) 


3/2 (0.201) 


6/5 (0.205) 


0/0 




Stress response 


20 


6/6(0.192) 


2/3 (0.229) 


10/10(0.159) 


2/1 




Transcription factors 


27 


4/8 (0.039) 


10/4 (Z.Oe- 3 ) 


12/14(0.112) 


1/1 




Transport and secretion 2 


31 


5/9 (0.047) 


2/5 (0.076) 


21/16 (0.031) 


3/1 


20 


Unknown 


175 


37/48 (0.015) 


19/28 (0.014) 


112/93 (8.3e"*) 


7/6 



The total number of tags and the observed/expected number of tags within the different cell 
cycle phases for each functional group is given together with the probability values between 
parentheses as calculated based on the binomial distribution function, except for the G1 -phase 
25 because the values were too small. A significant enrichment (P<e" 3 ) of tags of a functional 
group within a particular cell cycle phase is indicated in bold. 

1 Only kinases and phosphatases with unknown biological function. 

2 Except small GTP-binding proteins, which are classified under signal transduction. 



30 Table 2: overview o 



group 1 of sequences used for validation of candidate target gene s 



SEQID NO 


CDS NO 


Tag Name 


Function 


Fase 


1 


2216 


18R1850 C4-32-33 1E2 


catalase 


?? 


2 


2217 


Bstt2-31-215 


phytoene desasturase 


I?? 


3 


2218 


Bstc13-1-145 


L-ascorbate peroxidase 


M-G1 J 


4 


£21 9 


Bstc21 -4-280 


GTP-bindingprotein 


M 


5 


2220 


Bstc33-2-310 


vacuolarsortinqreceptor 


M 


6 


2221 


Bstc4-34-170 


probable cinnamyl alcohol dehydrogenase 


G1/S-S; M-G1 


7 


2222 


Bstt34-3-470 


kinesin 


M 


8 


2223 


Bsttl 2-3-410 


B-typeCDK 


M 


9 


2224 


Bsttl 4-3-458 


squalene mono-oxygenase I 


G1/S-S 


10 


2225 


Bstt12-1-230 


kinesin-iikeprotein 


M 


11 


2226 


Bstt23-4-230 


B-typeCDK 


M 


12 


2227 


Bstt2-42-225 


B-typeCDK 


M 


13 


2228 


Bstt31 -4-208 


arabinoqalactan protein precursor 


G2/M-M 


14 


2229 


Bstt 3-41-205 


arabinogalactan protein precursor 


G2/M-M ] 
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17 


2232 


Bstt4 1-2-400 


endo-beta-1 ,4glucanase 




18 


2233 


Bstt44-4-340 


acetolactate synthase 


G2/S-G2-M-G1 


19 


2234 


G1 7-2-1 3 G1 7-2-1 3 


WRKY transcription factor 




20 


2235 


mapk9-ntf6.seq 


mapkinase phragmoplast associated NTF6 ' 


»? 


21 


2236 


Bstt2-42-520 


prohibiten 


?? 



Table 3: overview of group 2 sequences of full-length sequences that are cell cycle 
modulated and of which some are involved in the cell cycle process 



SEQ 
ID 
NO 


CDS 
NO 


Gene name 


22 


061 3 


Protein kinase mRNA, complete , N. tabacum, 2073 bp 


23 


0614 


BY2 AA041K03 nrobable DNA-bindinn nrotein GBP16 - rice T02069 N tabacum 834 bD 


24 


0615 


BY2 AA042C09 Drobable nurlear DNA-bindinn nrotein G2d nmoortedl in ArabidoDSis T51 151 Ni 
tabacum, 1185bp 


25 


0616 


BY2-AA044J17 transcription regulator-like in Arabidopsis AB025604, N. tabacum, 1893bp 


26 


0617 


BY2 AA044J23 ATP-dependent RNA helicase CA3 of the DEAD/DEAH box family; Dbp3p; BY2- 
AA044J23P19G01 RNA helicase RH5 in Arabidopsis T51739 N. tabacum, 1593bp 


27 


0618 


BY2-AA046C15 protein phosphatase 2C-like in Arabidopsis BAB08417 AB025622, N. tabacum, 732bp 


28 


0619 


BY2-AA047G13 14-3-3-like protein C P93343, N. tabacum, 70bp 


29 


0620 


BY2-AA054L09 protein kinase tousled in Arabidopsis A49318 N. tabacum, 2037bp 


30 


0621 


BY2-AA066H11P19H05 phosphoprotein phosphatase 2A regulatory chain T03684 N. tabacum, 1764 bp 


31 


0622 


BY2-AA069L10 transrriDtion fartor-like nrotein in Arabidoosis BAB09482 AB012246 N tabacum 8^1 bn 


32 


0623 


RY2-AA07?K0fi SFT nrotein nhn<snatase 9 A inhibitor in Arabidoosis AAG52377 1 AC0117R^ W 
tabacum 


33 


0624 


RY?-AAn7*^MP1 QRD7 nhn«5nhnnrntf»in nho^nhataQP ?A rf^mtlatnrv rhain T0^fift4 N taharum 17ft4hn 


34 


0625 


RYP-AArYZ^iHI 9 Putative* nhr»«;nata«iP» PA inhihifnr in Arahidnnsic: ACOHfinQ Q Af^mififlQ M faharum 

LJ 1 / W f On 1 i_ 1 UlCHl VC \J\ iL/o^JcJlaoC II II IIUIll/l II 1 AAI UUIUUjJolo r\UU 1 1 Uv/ J J r\V»/U 1 1 UUC, 1 N . Id UcJUUI 1 1, 

783bp 


35 


0626 


BY2-AA076O02P19B08 hypothetical protein kinase in Arabidopsis T47727, N. tabacum, 2514 bp 


36 


0627 


BY2-AA079J13 putative casein kinase I in Arabidopsis AAG51841.1 AC010926 4 , N. tabacum, 1401bp 


37 


0628 


BY2-AA080G14 porin I 36K in potato S46959, N. tabacum, 393bp 


38 


0629 


BY2- AA081P13p21E02 separation anxiety protein-like in Arabidopsis CAB96669.1 AL360314, N. 
tabacum, 492bp 


39 


0630 


Complementary copy of 0630, N. tabacum, 975bp 


40 


0631 


BY2-AA085N17p21H04 14-3-3-like protein in potato 16R P93784 N. tabacum 768bp 


41 


0632 


BY2-AA087C16d21G03 AP2 domain transcriotion factor homoloa in Dotato T07784 N tabacum 891bo 


42 


0633 


BY2-AA088B13 nutative RING zinc finner nrotein in Arabidoosis CAB80936 1 AL161491 N tabacum 
1248bp 


43 


0634 


BY2- AAOQ^MOR nrotpin kinase hnmnlnn in Arahirionsis T02181 N tabacum8f58 


44 


0635 


BY2-AA096M07 oeotidvl-Droivl ris-trans isomera<;e-like oratein BAB10691 1 AB015468 N tabacum 
450bp 


45 


0636 


BY2-AA096M12 zincfmaer nrotein-like in Arabidoosis BAB09106 1 AB017069 N tabacum 1518bo 


46 


0637 


BY2-AA096M22 cell division-like protein in Arabidopsis T45963 N. tabacum 687bp 


47 


0638 1 


RY2-AA0QRB08d21 D1 1 similaritv tn OAC5 nrotein in Arahirionqis BAA97063 1 AP000370 N tabacum 
1 146bp 


48 


0638 2 


lcLAA091G16p21F05 N. tabacum 891bp 


4Q 




RY9-AAinQM1^ f^AMIV/11 nrntpin like* in ArahiHnnQic RARnfl4?fl 1 ARni7DR7 M tahanim ftftfthn rMYHI^ 

FAMILY, proliferation associated 


50 


0640 


Complementary copy of 0640 N. tabacum, 891 bp 


51 


0641 


BY2-AA1 14N16 unknown protein in Arabidopsis BAB03019.1 AP001297; candidate tumor suppressor 
p33 ING1 homolog in Homo sapiens N. tabacum 720bp 


52 


0642 


BY2-AA115P21p22D02 NAC2 Arabidopsis AAF09254.1 AF201456 1N. tabacum 699bp 


53 


0643 


BY2-AA119N11p22G04 serine/threonine-specific protein kinase-like protein BAB09338.1 AB016879 N. 
tabacum 1293bp 


54 


0662 


BY2-AA041E04 >pir||T06678 hypothetical protein T17F15.80 - Arabidopsis thaliana 


55 


0663 


BY2-AA043A01 >gb|AAD24540.1 |AF1 13545_1 (AF113545) vacuole-associated annexin VCaB42 
[Nicotiana tabacum] 


56 


0664 


BY2-AA044C02 >dbj|BAA02028.1| (D11470) chloroplast elongation factor TuB(EF-TuB) [Nicotiana 
tabacum] 


57 


0665 


BY2-AA044L14.dbj|BAA97319.1| (AB020754) gene id:MYN8.3~pir||T02891~similar to unknown protein 


58 


0666 


BY2-AA045P04p01G10 sp|Q43681|NLTP VIGUN PROBABLE NONSPECIFIC LIPID-TRANSFER 
PROTEIN AKCS9 


59 


0667 


BY2-AA046C08p19E02 dbj|BAB30364.1 1 (AK016659) putative fMus musculus] 
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60 ( 


3668 


3Y2-AA046E06 pir||T5055o stamina pistiHoidia protein otp limporteai - garaen pea 


61 


3669 


3Y2-AA046G14 dbj|DAD2bUo^.n | (ArMJuyi i f ) putaiive [ivius muscuiusj 


62 ( 


3670 


BY2-AA046H23 emb|CAA98 172.11 (Z73944) RAB8A [Lotus japonicus] 


63 i 


3671 


BY2AA048A05 gb|AAD1 5504.1 ) (AC006439) putativeAAA-type ATPase [Arabidopsis thaliana] 


64 ( 


3672 


BY2-AA049K03 db||BAB24909.1| (AK007240) putative [Mus muscuiusj 


65 ( 


3673 


BY2-AA051A10 dbj|BAB02543.1| (AP000417) mitotic checkpoint protein [Arabidopsis thaliana] 


66 


3674 


BY2-AA051L22p19H03 gb|AAD48948.1|AF!47262_1 1 (AF147262) contains similarity to Pfam family 
PF00400 -WD domain 


67 I 


3675 


BY2-AA052E10 >gb|AAF52905.1| (AE003628) CG4968 gene product [Drosophila melanogasterl 


68 


3676 


BY2-AA052F14 >qb|AAF79819.1IAC007396 20 (AC007396) T4012.22 fArabidopsis thaliana] 


69 


3677 


BY2-AA052G16p19D04 >dbj|BAB09843.1| (AB005246) gene__id:MUP24.12~unknown protein 
[Arabidopsis thaliana] 


70 


3678 


BY2-AA052N17 >qb|AAG42914.1 IAF327533 1 (AF327533) unknown protein [Arabidopsis thaliana] 


71 


3679 1 


BY2-AA053C11.1 >dbj|BAB22857.1| (AK003561) putative [Mus musculus] 


72 


3679 


2 BY2-AA053C11.2 >gb|AAC62883.1 1 (AC005397) hypothetical protein [Arabidopsis thaliana] 


73 


3680 


3Y2-AA062A09 >gb|AAF01061.1|AF189284_1 (AF1 89284) nucleolar G-protein NOG1 [Trypanosoma 
brucei] 


74 


3681 


BY2-AA062G03 >pir]|T02135 hypothetical protein F8K4.10 - Arabidopsis thaliana 


75 


3682 


BY2-AA065E08 >pir||T00795 hypothetical protein F24L7.13 - Arabidopsis thaliana 


76 


3683 


BY2-AA072K18 >emb|CAB40381.1| (AJ010819) GrpE protein fArabidopsis thaliana] 


77 


0684 


BY2-AA075K12 >gb|AAD31331 .1 |AC007354 4 (AC007354) T16B5.4 [Arabidopsis thaliana] 


78 


0685 


BY2-AA076N08 >dbj|BAA94770.1 1 (AP001859) ESTs AU082761 (S5084) D42006 


79 


0686 


BY2-AA080D01 >gb|AAF80646.1|AC012190_2 (AC012190) Contains similarity to F28016.19 a putative 
translation initiation protein 


80 


0687 


BY2-AA081P14 >gb|AAD32777.1)AC007661 14 (AC007661) unknown protein [Arabidopsis thaliana 


81 


0688 


BY2-AA082H04p21F02 >dbj|BAB10171.1| (AB016880) geneJd:MTG10.12-pir||T05795-strong similarity 
to unknown 


82 


0689 


BY2-AA082H06p21G04 >pir|jT09039 hypothetical protein F26K10.110 - Arabidopsis thaliana 


83 


0690 


BY2-AA082M07p21B05 >dbj|BAB01783.1| (AB022215) gene_id:MCB17.19~unknown protein 
[Arabidopsis thaliana] 


84 


0691 


BY2-AA083B24p21C04 >dbj|BAB08247.1| (AB006698) gene_Jd:MCL19.6~unknown protein [Arabidopsis 
thanliana) 


85 


0692 


BY2-AA083C05p21D02 >gb|AAH02924.1|AAH02924 (BC002924) Unknown (protein for 
livlAGE:3956179) [Homo sapiens] 


86 


0693 


BY2-AA085D08p21C05 >pir||T47624 hypothetical protein T5N23.10 - Arabidopsis thaliana 


87 


0694 


BY2-AA085F09p21H01 >qb|AAF79503.1|AC002328 11 (AC002328) F20N2. 15 [Arabidopsis thaliana] 


88 


0695 


BY2-AA085M15p21D04 >gb|AAF97305.1|AC007843_8 (AC007843) Unknown protein [Arabidopsis 
thaliana] 


89 


0696 


BY2-AA088K23p21G05 >gb|AAG52001.1|AC012563__1 1 (AC012563) unknown protein; 64612-65506 
[Arabidopsis thaliana] 


90 


0697 


BY2-AA088L24p21A07 >gb|AAD55292.1|AC008263_23 (AC008263) Contains PF|00249 Myb-like DNA- 
binding domain. 


91 


0698 


BY2-AA089F12p21H05 >gb|AAD55274.1|AC008263_5 (AC008263) btrong simiianty to gD|U^iouo 
calcium-dependent protein kinase 


92 


0699 


BY2-AA089M17 >pir||T02186 hypothetical protein F14M4.16 - Arabidopsis thaliana 


93 


0700 


BY2-AA090J23p21G08 >pir||T48545 hypothetical protein F1 4F1 8.30 - Arabidopsis thaliana 


94 


0701 


BY2-AA092F12p21H06 >emb|CAB46854.1 1 (AJ388555) hypothetical protein fCanis familiaris] 


95 


0702 


BY2-AA092L20p21 E07 >gb|AAD10646.1| (AC005223) 45643 [Arabidopsis thaliana] 


96 


0703 


BY2-AA093J23p21C11 >gb|AAG51461.1|AC069160__7 (AC069160) unknown protein [Arabidopsis 
thaliana] 


97 


0704 


BY2-AA093L1 8p21 D09 >emb|CAC1 5504.11 (AJ297917) Bz-type cyctin aepenoeni Kinase [Lycopersicon 


98 


0705 


BY2-AA093M19 >qb|AAG1 2535. 1|AC01 5446 16 (AC015446) Unknown protein [Arabidopsis thaliana] 


99 


0706 


BY2-AA094B12p21F10 >dbj|BAB02118.1| (AP000381) contains similarity to unknown 


100 


0707 1 


BY2-AA096G05p21A11 dbi|BAB02118.1I (AP000381) contains similarity to unknown 


101 


0707 2 


Id AA094B12p21F10 


102 


0708 


BY2-AA097G22p21D10 >gb|AAG60065.1|AF337913_1 (AF337913) unknown protein [Arabidopsis 
thaliana . 


103 


0709 


BY2-AA099F04 gb|AAG52457.1 |AC01 0852^1 4 (AC010852) hypothetical protein; 12785-11538 
[Arabidopsis thaliana] 


104 


0710 


BY2-AA099N08p21H09 qb|AAK1441 1.1IAC087851 3 (AC087851) unknown protein [Oryza satival 


105 


0711 


Id AA100B09 reflNP 009820.11 Ybr261cp [Saccharomyces cerevistae] 


106 


0712 


r>wo a AnnnKir\o —-.amo nmD/i q "ii rkarnwimm^i f«^»rt-»ckt»%/io+£iri nrrvioin* Mnt l^pkppninQ nene 33kD THomo 
BY^-AAT u&riuj. reTiNK uu^o^fo. ij peroxisomal Tarnesyiaiea proiein, nuuawccpiny y c,,c i 1 ^ 

sapiens • . 


107 


0713 


BY2-AA114E09p22F02 pir||T51434 hypothetical protein F2G14 10 - Arabidopsis thaliana 


108 


0714 


BY2-AA115B14p22C02 dbj|BAB08888.1| (AB012243) geneJd:MIJ24.6~ref|NP - .01 3897.1 -similar to 
unknown protein 



30 



BNSDOCID: <WO 03085115A2_I_> 



WO 03/085115 PCT/EPOJ/03703 



109 


0715 


BY2-AA115F08p22CO4 >gb|BY2-AAH03900.1|AAH03900 (BC003900) Similar to hypothetical protein 
^ft4Hft R riV/lit^ muscuiusl 


110 


0716 


BY2-AA115L12p22G01 >gb|AAF43925.1|AC012188_2 (AC012188) Contains similarity to PIT1 from 

AraKiHrin^ic ihalian?} 


i i i 


H71 7 


ry? AA11RI 9^n?s>Fm >dbHBAB01460 11 (AP000731) aene id :MCB 17.21 -unknown protein 
TArabidopsis thaliana] 


112 


0718 


BY2-AA117B12p21G12 >sp|O23708]PSA2_ARATH PROTEASOME SUBUNIT ALPHA TYPE 2 (20S 
PROTEASOME ALPHA SUBUNIT B) 


113 


0719 


[BY2-AA1 1 7E08p22A03 >pir||F81195 conserved hypothetical protein NMB0465 [imported] - Neisseria 


114 


0720 


BY2-AA117O08p22E03 >dbj|BAB01 753.1| (AP000603) qb|BY2-AAD10646.1~g;eneJd:MRP15.12 


115 


0721 


BY2-AA1 18D23p22E02 >emb|CAB894 90.1| (AJ277062) CRK1 protein fBeta vuigans], cdc2 like kinase 


116 


0722 


BY2-AA119D12p22H04 >dbj|BAB01 163.1 1 (AP000410) gene_id:K10D20.9~un known protein 
[Arabidopsis thaliana] 


117 


0723 


BY2-AA120G12 >qb|BY2-AAB63649.1| (AC001645) hypothetical protein TArabidopsis thaliana] 


118 


0724 


BY2-AA120G19p22D05 >gb|BY2-AAF69547.1|AC008007__22 (AC008007) F12M16.18 [Arabidopsis 
thaliana] 



Table 4: overview of group 3 sequences that show homology with proteins of unknown 



SEQ * 

ID 

NO 


rag name and 


function 


r ase 




SEQ " 
D 

MO 


rag name and 


function 


Fase 


119 


3stc1 -11-320 




VI-G1 




160 I 


3stc31-3-400 


jnknown 


oxd/M-M-ol 


120 


3stc1 -12-255 




32/M-M-G1 




161 


3stc32-1-122 


jnknown 


VI-G1 


121 


3stc1 -12-275 




32/M-M-G1 




162 


3stc3-21-125 




C31/0-S, 
oZ/M-ivi-ol 


122 


3stc1-13-143 


jnknown protein 


327M-M-G1 - 


123 


3stc1-13-160 


jnknownprotein 


32/M-M-G1 




163 


Bstc32-2-150 


— . ; 

putativeprotein 


bl/o-o, 
G2/M-M-G1 


124 


DStcn-o- iyu 




vl-o I 


125 


Bstd 1-3-21 5 


putativeprotein 


32/M-M-G1 




164 


Bstc32-4-193 






126 


Bstd 1-3-230 




G1/S; M-G1 


165 


3stc32-4-o7U 




VI-G1 


127 


Bstd 1-3-300 


unknown 


M-G1 


128 


Bstd 3-4-168 


hypotheticalprotein 


S-G2 


166 


Bstc3-31-350 


putativeprotein 


G1/S-S-G2/S 


129 


Bstd 3-4-290 


hypotheticalprotein 


M-G1 


1o7 




lypoineiicaiproiein 


G2/M-M-G1 


130 


3stc1 4-205 




G2/S-G2 


131 


Bstd -43-1 07 




G2/S-G2 


168 


Bstc3-33-350 




G1/S-S 


132 


Bstc14-3-165 


unknown 


M-G1 




□SICoo-ooU 


JULdll Vcpi ULCM 1 


H7/M-M-G1 

waj ivrivi vj i 


133 


Bstd -43-250 


unknown 


G2/M-M-G1 


170 


Bstc33-4-270 


unknown 


G2/M-M 


134 


Bstd-43-310 


hypotheticalprotein 


G2/M-M 


171 


Bstc3-41-270 


unknown 


M-G1 


135 


Bstc21 -2-270 


hypotheticalprotein 


G2/M-M-G1 


172 


Bstc3-41-300 




G2/M-M-G1 


136 


Bstc2-21-182 


unknown 


M-G1 


173 


Bstc3-41-360 




G2/M-M-G1 


137 


Bstc22-1-275 


unknownprotein 


G2-M-G1 


174 


Bstc3-42-175 




M-G1 


1 OO 


D e i-o on -inn 
D S lCZ-Z^- \ uu 


un Known 




175 


Bstc3-43-135 




G1 


139 


Bstc2-22-155 




G2-M 


176 


Bstc3-43-180 




M-G1 


140 


Bstc2-22-240 


hypotheticalprotein 


M 


177 


Bstc3-43-193 


unknown 


G1/S-S; 
G2m0-M-G1 


141 


Bstc22-2-270 




G1/S; M-G1 


142 


Bstc2-23-135 




G2/S-G2-M 


178 


Bstc3-43-287 




G1/S-S 


143 


Bstc2-23-220 


unknown 


G2-M-G1 


179 


Bstc3-44-145 




M-G1 


144 


Bstc22-4-215 


hypotheticalprotein 


G2/M-M 


180 


Bstc3-44-375 


putativeprotein 


M-G1 


145 


Bstc2-31-280 




G2/M-M-G1 


181 


Bstc4-1 1-120 


hypotheticalprotein 


G2/M-M-G1 


146 


Bstc23-2-240 


unknown 


M 


182 


Bstc4-1 1-320 


unknown 


M-G1 


147 


Bstc23-2-330 


putativeprotein 


M 


183 


Bstc42-3-115 


unknown 


M-G1 

G2/M-M-G1 


148 


Bstc23-2-370 




G1/S-S; 
G2/M-M-G1 


184 
185 


Bstc42-3-125 
Bstc4-23-210 


putativeprotein 


M-G1 


149 


Bstc2-32-400 




G1/S-S; 
G2/M-M-G1 


186 


Bstc42-4-225 


unknown 


G1/S-S-G2 


187 


Bstc4-32-115 


unknownprotein 


G1/S-S; 
G2/M-M-G1 


150 


Bstc23-3-270 




G1/S-S; M-G1 


151 


Bstc2-33-280 


unknownprotein 


G1/S-S;M-G1 


188 
189 


Bstc4-32-185 
Bstc4-32-190 


unknown 
unknown 


G1/S-S 
G2/M-M 


152 
153 


Bstc2-34-120 
Bstc23-4-300 


unknown 
unknown 


G2/M-M-G1 
M 




190 


Bstc4-32-270 


unknown 


G2/S-G2-M 


154 
155 
156 
157 
158 
159 


Bstc2-41-165 

Bstc2-42-100 

Bstc2-43-210 

Bstc31-185 

Bstc3-12-145 

Bstc3-1 2-290 


unknown 

unknown 
unknown 
unknown 


G1/S-S 
G1/S-S 
M-G1 

G2/M-M-G1 
S-G2 

G2/M-M-G1 




191 

192 
193 
194 
195 


Bstc4-32-410 

Bstc4-34-250 
Bstc4-41-230 
Bstc4-43-113 
Bstc44-3-125 


putativeprotein 
unknown 


G1/S-S-G2- 
G2/M 

G2/M-M-G1 
G2/M-M-G1 
M-G1 
G2/M-M 



WO 03/085115 



PCT/EP03/03703 



196 


Bstt1-12-340 


unknown 


G2/M-M 


197 


Bsttl 2-2-225 




G1/S-S-G2 


198 


Bstt1-22-330 


unknown 


G2/M-M-G1 


199 


Bsttl 2-2-420 


unknownprotein 


G2/M-M-G1 


200 


Bsttl 2-2-540 


hypotheticalprotein 


G2/M-M-G1 


201 


Bsttl -23-1 55 




M-G1 


202 


Bstt12-3-215 


hypotheticalprotein 


G2/M-M-G1 


203 


Bsttl 2-3-280 


unknown 


G1/S-S-G2 


204 


Bsttl 2-3-310 


hypotheticalprotein 


G1/S-S 


205 


Bsttl 2-3-350 




G1/S-S-G2- 
G2/M 


206 


Bsttl -24-205 




G2/M-M-G1 


207 


Bsttl -24-220 




G1/S-S-G2 


208 


Bsttl -31 -170 


hypotheticalprotein 


G2/M-M-G1 


209 


Bsttl -31 -21 5 


unknown 


G2/M-M-G1 


210 


Bsttl 3-210 


unknown 


G2/M-M-G1 


211 


Bstt14-4-310 


unknownprotein 


G2/M-M-G1 


212 


Bstt2-11-165 


unknown 


G2/M-M-G1 


213 


Bstt2-12-190 




G1/S-S-G2 


214 


3stt21-4-150 


hypotheticalprotein 


G1/S-S-G2/S 


215 


Bstt2 1-4-250 




G1/S-S; 
G2/M-G1 S 


216 


Bstt21 -4-470 




G2/M-M-G1 i 


217 


Bstt22-1-170 


unknown 


S-G2 


218 


Bstt2-21-190 


unknown 


G2/M-M 


219 


Bstt22-2-190 


unknown 


G2/M-M-G1 


220 


Bstt22-2-290 


hypotheticalprotein 


G2/M-M-G1 


221 


Bstt22-3-225 




M 


222 


Bstt22-3-275 


unknown 


G2/M-M 


223 


Bstt22-3-315 


TomatoEST 


G2/M-M-G1 


224 


Bstt22-3-370 


unknown 


G2/M-M-G1 


225 


Bstt22-3-390 


putativeprotein 


G2/M-M-G1 


226 


Bstt22-3-480 




G2/M-M-G1 


227 


Bstt23-1-140 




S-G2-G2/M 


228 


Bstt23-120 


unknownprotein 


G2/M-M-G1 


229 


Bstt23-1-200 




S-G2-M 


230 


Bstt2-31-300 


unknown 


S 


231 


Bstt2-32-220 




M 


232 


Bstt2-32-400 


hypotheticalprotein 


G2/M-M-G1 


233 


Bstt23-3-350 


unknown 


G2-M 


234 


Bstt23-370 


unknown 


G2/M-M-G1 




DStt24-1 -OZU 






236 


Bstt24-2-310 




G27M-M-G1 


237 


Bstt2-43-210 


unknown 


G2-M 


238 


Bstt2-43-240 




S-G2/S 


239 


Bstt31-1-100 


hypotheticalprotein 


G1/S-S-G2 



240 


Bstt3-1 1-205 




G1/S-S-G2 


241 


Bstt31 -1-250 


: : 

nypotheticafprotein 


G2/M-M-G1 


242 


Bstt31 -1-430 


lypotheticalprotein 


G2/M-M-G1 


243 


Bstt3-1 2-360 


unknownprotein 


G2/M-M 


244 


Bstt31 -3-380 




G1/S-S 


245 


Bstt31 -4-420 


hypotheticalprotein 


G2/M-M-G1 


246 


Bstt32-180 


putativeprotein 


G2-M-G1 


247 


Bstt3-22-160 


PotatoEST/hypothet 
icalprotein 


G1/S-S-G2 


248 


Bstt32-3-175 


unknown 


G2/M-M 


249 


Bstt32-3-325 


unknown protein 


G2/M-M-G1 


250 


Bstt 3-24-1 35 


unknown 


/a a i 1 ^ 

G2/M-M-G1 


251 


Bstt3-24-200 


- 


G2/M-M-G1 


252 


Bstt3-31-215 


unknownprotein 


G2/M-M-G1 


253 


Bstt 3-31 -330 


unknown 


G1/S-S-G2 


254 


Bstt33-1 -350 


unknown 


G2/M-M-G1 


255 


Bstt33-1-510 


putativeprotein 


G2/!vhM-G1 


256 


Bstt33-3-220 


unknown 


GZ/M-M-G1 


257 


6stt33-3-245 


unknownprotein 


G2/M-M-G1 


258 


3 stt 3-33-550 


hy potheticaiprotei n 


G1/S-S, M-G1 


259 


Bstt33-4-140 


putativeprotein 


S-G2 


260 


Bstt34-2-165 


unknown 


G1/S-S-G2 


261 


Bstt3-42-325 


hypotheticalprotein 


G2/M-M-G1 


262 


Bstt3-44-150 


unknown 


G2/M-M-G1 


263 


Bstt3-44-250 


unknown 


G2/M-M-G1 


264 


Bstt34-4-310 


unknown 


G2/M-M-G1 


265 


Bstt3-44-345 


hypotheticalprotein 


G2/M-M-G1 


266 


Bstt41 -2-340 




r— /a. a a a ^i 

G2/M-M-G1 


267 


Bstt41 -3-310 


unknown 


G2/M-M 


268 


Bstt4-21-185 




M-G1 


269 


Bstt42-1-370 




S-G2-G2/M 


270 


Bstt4-23-480 


unknown 


G2/M-M-G1 


271 


Bstt4-24-170 




G2/M-M-G1 


272 


Bstt43-265 


unknown 


G1/S-S-G2/M 


273 


Bstt43-3-350 


unknown 


G2/M-M-G1 


274 


Bstt4-33-390 


hypotheticalprotein 


G1/S-S;G2/M- 
M-G1 


275 


Bstt4- 34-280 




G2/M-M-G1 


276 


Bstt43-4-300 


unknownprotein 


/—^ rs /a a a a ^ 

G2/M-M-G1 


277 


Bstt43-4-330 


unknownprotein 


G2/M-M-G1 


278 


Bstt43-4-340 




G2/M-M-G1 




p e-t+A A A O Rf\ 
D STl*f*H»-^ OU 


nypOll lfc?UoolJJIlJLCll 1 




280 


Bstt4-44-400 


hypotheticalprotein 


G2/M-M-G1 


281 


MBc03-90 


unknown 


S-G2 


282 


MBC42-180 


unknown 


G2-M-G1 


283 


MBC43-210 


unknown 


G1/S-S-G2 



Table 5: overv 


ew group 4 sequences she 


SEQ 

ID 

NO 


Tag name 


Function 


Fase 


284 I 


3stc1 1-100 


unknown 


G2/S-G2-M 


285 I 


3stc1 -11-110 


unknown 


S 


286 I 


3stc1 -11-115 


unknown 


G1/S-S;G2/M-M-G1 


287 I 


?stc1 -11-120 




G1/S-S-G2 


288 


istd 1-1-125 


unknown 


G2/M-M-G1 


289 


6stc1 1-1-290 


NaD 


G1/S;G2/M-M-G1 


290 | 


5stc1 -12-155 




G2/S-G2-M 


291 ( 


istd -12-175 


unknown 


S 


292 \ 


Jstd -12-185 


unknown 


G2/M-M-G1 


293 i 


3stc1 1-3-116 


unknown 


S-G2 


294 [ 


Sstd 1-3-118 


unknown 


G2/M-M-G1 


295 \ 


3stc1 -13-120 




S 


296 I 


Sstd -13-130 




G1/S-S; G2/M-M-G1 


297 


pstd -13-132 


unknown 


M-G1 



SEQ 

ID 

NO 


Tag name 


Function 


Fase 


298 I 


3stc1 -13-142 


unknown 


G1/S-S 


299 \ 


Sstc 11-3-187 


unknown 


S-G2/S 


300 I 


3stc1 1-3-200 


unknown 


G1/S-S-G2/S 


301 ( 


istd 1-3-290 


unknown 


G2/S-G2-M-G1 


302 


3stc1 -14-100 


unknown 


P2/M-M 


303 \ 


istd -14-108 


unknown 


G2/M-M-G1 


304 E 


Bstd 1-4-130 


unknown 


G1/S-S-G2 


305 \ 


3stc1 1-4-135 


unknown 


G2/M-M-G1 


306 i 


3stc1 1-4-140 


unknown 


S-G2-M 


307 ( 


3stc1 -14-155 




G2/M-M 


308 I 


istd -14-165 




G2-G2/M 


309 I 


3stc1 -14-167 




G2-G2/M 


310 I 


3stc1 1-4-175 




G2/M-M-G1 


311 I 


istd 1-4-200 


unknown 


G1/S-S i 



32 



BNSDOC1D: <WO 03085 1 1 5A2_U> 



WO 03/085115 PCT/EP03/03703 



O IZ I 


SSIC I 4-rl IU I 


jnKnown « 






377 | 


i stc22-1-98 li 


jnknown 


S-G2-G2/M 


| 

j lo I 


5SIC I I-IOU I 


jnKnown 






378 I 


5stc2 2-2-110 


jnknown 


G2/M-M-G1 


O i A I 


5SIC1 Z-l-loU I 


JnKnown 






37Q 1 


Jstc2 -22-160 


unknown 


G1/S-S; G2-G2/M 


0 4C I 

olo t 


)s»+~,h o w o>in i 


jnknown 


\/\ riw 




JOU 1 




unknown 
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CLAIMS 

1. A method for identifying and validating plant genes/proteins as targets for agrochemicals, 
said method comprising the steps of: 

a Determining gene or protein expression profiles during a biological process of a plant or 
5 plant cell, said biological process being necessary for the growth and/or development 

and/or viability of the plant or plant cell; 
b Selecting genes or proteins having altered expression during said biological process, 
c. Cloning said selected gene or the nucleic acid encoding said protein in its full-length or 
partial form, 

1 o d Incorporating said nucleic acid in a vector designed for downregulation of express.on of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant 
cell. 



15 



2. The method according to claim 1, wherein said biological process cell division. 

3 The method according to claim 1 or 2, wherein said gene or protein expression profiling is 
based on nucleic acid or protein samples collected from a synchronized culture of div.dmg 
plant cells. 



20 4. The 



method according to claim 3,wherein said dividing plant cells are tobacco BY2 cells. 



5. The method according to any of claims 1 to 4, wherein the expression profiles are 
determined by means of micro-array, macro array or c-DNA-AFLP. 

25 6. The method according to any of claims 1 to 5. wherein said downregulation involves a 
viral-induced gene silencing mechanism. 

7 The method according to any of claim 1 to 6, wherein said downregulation involves the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 

30 f 
8. A method for screening candidate agrochemical compounds comprising the use of any of 

the methods according to claim 1 to 10. 

9 A method for screening candidate agrochemical compounds comprising the use of any 
35 one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 

thereof or one or more of the proteins corresponding to SEQ ID NO 1 to 785 or a 
homologue, functional fragment or derivative thereof . 
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10. A method for the production of an agrochemical resistant plant, comprising the use of any 
one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 
thereof or one or more of the proteins encoded by SEQ ID NO 1 to 785 or a homologue , 
functional fragment or derivative thereof. 

1 1 . An isolated nucleic acid identifiable by any of the methods according to claims 1 to 10, 

•12. An isolated nucleic acid, comprising at least part of a nucleic acid sequence chosen from 
the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative thereof. 

13. Use of a gene nucleic acid according to claim 11 or 12 or the protein encoded by said 
isolated nucleic acid as a target for an agrochemical compound. 

14. Use of a nucleic acid or protein according to claim 13, wherein the agrochemical 
compound is a herbicide. 

15. A plant tolerant to an agrochemical, in which the expression level of one or more of the 
nucleic acids corresponding the SEQ ID NO 1 to 785 or the homologue, functional 
fragment or derivative thereof, is modulated. 

16. A harvestable part of a plant according to claim 15. 



030851 15A2 _l_> 



38 



WO 03/085115 



1/4 



PCT/EP03/03703 





— cluster 1 
cluster 5 

— —cluster 6 

— - cluster 10 




BNSDOCID: <WO 



030851 15A2_I_> 



WO 03/085115 



2/4 



PCT/EP03/03703 




FIGURE 



BNSDOCID: <WO 030851 1 5A2_I_> 



WO 03/085115 



3/4 



PCT/EP03/03703 




FIGURE 2 (continued) 



BNSDOCID: <WO 030851 1 5A2_L> 



WO 03/085115 



4/4 



PCT/EP03/03703 



Exp. Ill - 37 days after inoculation 
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Sequence Listing 

GROUP l " 
SEQIDNOl 

GAATTCACTAGTGATTGATGAGTCCTGAGTAAGGTGAGACGAGAAGCGACCTTCTGACCACA 
AGACTTGTCAGCCTGAGACAGGTATGATATCCATATACTGCGTATCTCATAAGTGACTCGTG" 
GATCGGATAAATGCTCAACCCATTTGCTAACATATCTGTCTTGCCTGTCAGGTTCCCAGGAT 
CACTACGCAGT"'AATCGAATTCCCGCGGCCTATAGTGAGTCGTATTAA 

SEQIDN02 

TGACTGCGTAC t aTCTCAAAGAAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCT 

TCTTCAGGGTi. ^CAAACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTT 

CACACAGTGT r ACATGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCAC 

CGGTTTGCTCC TACTCAGGACTCATCA 

SEQIDN03 

GNNATGCCCGJ .-TAAGCCGCCCCTANATACANTTNAAATGGTCCCGGANACCCTGGGNGAC 
AATNATNGAC: 4GCAGTGGTTGAAGNTTGACAATTCCTATT 

SEQIDN04 

CNCNATTNTN, ' AAGCCCGAAAAAGAAGAAGTAANGGAGGGAGAAGGCCTGAAGAGAATTT 
GCNGGATTTT r AGAGTTGATACAGM^JTTCTTCCAAACTGTTCCACCTTGGATTATGACAA 
GTAGTGTCACC \ACCAAGGTCGAGATGAGATACTGTTGCACATGTCTCAGCTGCGAAACTAT 
TGGCTCAAGC;. TTGAGTTGGCATCATATGA 

SEQIDM05 

ttttangnca! r^aatctcnctctaacggaccctngcatggcttgttcaaaataaatgcctc 
aggacaatacc -.cgttatgtaatggggagtgaactgcgtatatccgttctgctcntttatct 
gggcggngcctttgaagtttttgacaaactctntctggntctcacacttagggccacactca 
tcattactgt: :gtccaaaactcgtactcaactctttcatcgggatgtggaagcgcctctct 

CCAATCAAGGi .7TATG 
SEQIDN06 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
TCACAAAATGAACANCAGCCNTTGAGAATCNCAGCTG.TGNTATGTANNTTCGAAGACATTGG 
CTGAGGATGCTGCATGGAAGTTTGTGAAAGAGAAAGCCTATCGATATGGTTACGATAAACCC 
AGCAATGGTTATTGGCGGTTTGTTACAACCAATAC 

SEQIDN07 

CCCAAGATGAACAGTCAGTCAAGTCGATCTCATGCCATATTTACAATTACATTGGAACAAAA 
GAGAATGGCTAATTGCTCGACGAACGATGATGGTGATGACATATTATGTGCCAAGCTTCATT 
TGGTTGACCTTGCTGGTTCAGAGCGAGCAAAGCGAACTGGAGCTGATGAGATGCGTTTACGA 
NAGGGNATTCATATCAACAGGGGATTGCTTGCTCTTGGCAATGTAATAAGTGCCCTTGGTGA 
TGAAAAGAAGCGGAAAGAAGGNGCACACATCCCATACAGAGATAGCANGTTGACACGTNTCT 
TACAGGACTCACTTGGAGGAAACAGCANGACAGTTATGATTGCTTGTGTCAGTCCTGCTGAC 
ACCAATGCAGAGGAGACCC 
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SEQIDN08 

GCGGTTGATATGTGGTCTGTGGGATGTATTTTTGCCGAGATGGTTCGAAGGCAAGCCTTATT 
TCCTGGTGACTCTGAGTTTCAGCAACTGCTTCACATATTCAGGCTGTTAGGAACCCCAACTG 
AGAAGCAGTGGCCTGGAGTCAGTTCACTCCGCGACTGGCATGTTTATCCAAAATGGGAACCT 
CAGAACTTGGCCTCTGCTGTTCCAGCATTGGGTCCTGATGGCGTGGACCTCCTCACGAAAAT 
GCTCCAATATGATCCGGCAGATAGGATTTCAGCAAAAGCTGCACTTGATCATCCATACTTCG 
ATAGCTTGGACAAGTCTCAGTTTTGAGGTTGCTTCTACTTCTAAGATCAGCC 

SEQIDN09 

GCAGCNAGCNAAGNTNNGGTNGGGNACGCCAANNANNGNTGTGCCTTTGATGACGTCACCAG 
NTATCANTCTACATANAACGGAGGNCTTGCGANNGGCTTGGNTCATTCTACNGNTCTAGGAT 
TNTCAACTCTNNTCAATTCTTCNATAACTNACCTATTCTCCTGCAGCAATATGTGAGACGTA 
ACCTAGAATATTATTTGCCTTTATAGATATTGACTTATTCTGCTTGCATATTTTATCTGCAG 
CCGGTGGCATCTACCATCAATACATTGGCTGGTGCATTGTATAAGGTGTTTTGTGCTTCACC 
TGATCAAGCTAGGAAGGAGATGCGCGATGCATGCTTTGACTATTTGAGCCTTGGTGGAGTAT 
TCTCCACAGGACCTGTATCTTTGCTTTCTGGC 

SEQIDNO10 

GTCAGTGCTTGAGCTGAACCCGTTGCTTGGACTTGACAACTAGCATCTTCTCTTTGCATGCT 
GCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTATGTATTACAAACTATTA 
TTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGAAATAATCTCT 

SEQIDNOll 

CGTTGNTTGTTTCGGGAAATTGGAACAGCATTGGTGAAGGCACTTACGGNCAAGTGTACATG 
GNTAAAGAAATTAGAACAGGGGAAATTGTTGCNTTGAAGAAGATACGCATGGACAACGAANG 
AGAAGGGTTTCCAATANCTGCTATACGTGAAATCAAAATCTTGAAGAAGCTGCACCATGAAA 

ATGNGA 
SEQIDN012 

TAAAGGACCGNTTTTGTTTCGAGAAATTGGNNCAGATTGGTGAAGGCACTTACGGTCAAGTG 
TACATGGCTAAAGAAATTAGAACAGGGGAAATTGTTGCTTTGAAGAAGATACGCATGGACAA 
CGAAAGAGAAGGGTTTCCAATAACTGCTATACGTGAAATCAT^AATCTTGAAGAAGCTGCACC 
ATGAAAATGTGA 

SEQIDN013 

GGACGTTTGCATTTCGGATTNGNGCACGAGATGTTNATGATTTTAGGATTTATTTTAGTCAT 
CTTACTCGGNTGATGTTTATTCGTTTTTGTGACTTTTACTCGNGGGCGGNGGTGACCGCGTA 
CATGCTATTTATTTGATTTTTACTATGGNTATTGNTTATTGTTA 

SEQIDN014 

TTACNTTTACTGAGATNNTTATGATTTTAGGATTTATTTTAGCCATCTTACTCNGGTGATGT 
TTATTCGATTNTGTGACTTTTACTCGNGGGCGGTGGNGACCGCGNACATGCTATTTATTTGA 

TTTTTACTATGGTTATTGTTTATTGTTA 
SEQIDN015 

TATCAAATGGAGAAGTTATCAATATGAAAATAGCTGTCAAGCCACTTCAACTATTGCTAGGA 
AGCAGCAAACTGTGACGCGAGATAAACATGACACAGAACTCATTGCTAGGGGTNGNNATGAT 
NCTTGTGTAGTTCCCNAANCTGTNCCANTGNTTTAAGCAATGGTAGCCCTGGCGCTAGTGGA 

TNAGCTAATGGCTCATTATGCACAGTGTATGCTGTTCCCAA 

FIGURE 4 (continued) 



BNSDOCID: <WO ( 



030851 15A2_I_> 



WO 03/085115 



3/140 



PCT/EP03/03703 



SEQIDNOl 6 

TTGAAGAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCTTCTTCAGGGTGCTGCAA 
ACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTTCACACAGTGTTTTTACA 
TGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCACCGGTTTGCTCGTC 

SEQIDNOl 7 

GGGCCCGCCACCACCGCAACCACCTAGTTATTCCTCCGTCGAACACGTGTCTCACGAGAGTG 
AGAGTGAGAGCGTTCATCGTCAGCATGATCATCATCGTTTTCAACCACATGTGCCTTCATTC 
TTCCACCATGAGACCTCACCACATCCAGAGCTCATCGATAAGCCTTCATTTAGGGTTTATAC 
AAAGGCTGATCCCAATTACTCTCTCACTATCCGTGACGGCAAAGTCGTTCTTGCCTCTTCTG 
ATCCATCCGATCCTTTTCAACACTGGTATAAAGATGAGAAGTACAGCACTAAAGTGAAGGAT 
GAAGAGGGGTTTCCAAGCTTTGCTCTGG 

SEQIDNOl 8 

GTNTNTGTGGCCCACCTGCTGCNAGAGTGACACACAGGNATGATCTTAGAGCNGCNATTCAG 
AAGATGTTAGACACTCCTGNGCCATACTTGNTGGATGTGNTTGTAC.CTCATCAGGAACATGT 
TNTACCTATGATTCCCAGAGGCGGNGCTTTCAAAGATGTGATCACAGAGGGTGACGGGAGAA 
GNTCCTATTGANTTTGAGNNGCTACAGAGCTAGTTCTAGGCCTTGCATTATCTAAAATAAAC 

TTCTA 
SEQIDNOl 9 

GCAAGGAGTCAAGTGGATATTTTGGATGATGGTTATAGATGGAGGAAATACGGACAGAAGGC 
TGTCAAGAACAACAGATTCCCAAGAAGCTACTACCGATGCACGCATCAAGGATGTAACGTGA 
AGAAACAAGTACAAAGGCTGTCAAAGGATGAAGGAGTAGTAGTAACTACTTATGAAGGCATG 
CATTCACATCCCATTGAGAAGTCCACAGATAACTTTGAGCACATTTTGACTCAGATGCAAAT 
CTATGCTTCCTTTTGAAACGTCCATCACTTCAATGCCTAAGGCATGACACTCAATTAGTCAC 
TTGTAAAATAGTACTACAGTATATTGTGTACATGCGTTTTGAACCTAGATGCTATATTTTGA 
AATAAAACGCAACTTCATTAGGGAATTTAATTTGATCATTGTACAACTAAAAGTAATGTTGC 
TATTTTTTTGTTTTTATCAC.TTTGTTTTTGCCGGAGCCATGCTCTTCATTTTAACTCTTTTC 
TTTTAGAATTAACAAATAATTTCATGTTGGAGAAAGATACGTGCCAAAAAAAAAAAA 

SEQIDNO20 

TAATGGACACGGATCTGCACCAGATAATACGCTCTTCACAAGCACTGACAGAAGATCACTGC 
CAATACTTTCTCTATCAATTATTACGTGGACTCAAGTATGTACATTCAGCTAATGTCCTCCA 
CCGGGATCTGAAACCTAGCAACTTACTACTCAACGCAAACTGTGACCTCAAGATTTGTGATT 
TTGGGCTAGCTAGAACCACTTCAGAGGCGGATTTTATGACTGAGTATGTTGTCACCCGCTGG 

SEQIDN021 

TTTGCAGCCACCTTTNACATTTCGGTAGANGATNTGTCCATAACAAGCCTGACTTTTNTAAA 
GGAATTTACAGCTTCAATTGAAGCAAAACAAGTGGCTGCTCAAGAAGCTGAAAGAGCAAAGT 
TTGTTGTGGAAAAAGCTGAGCAAGATAAGCGAAGTGCTGTTATCAGAGCTCAGGGTGAGGCT 
AAGAGTGCCCAGCTTATTGGTCAAGCGATTGCCAATAATCCGGCATTTATCACACTCAGGAA 
AATCGAAGCAGCAAGAGAGATTGCCCAGACTATCTCACATGCAGCAAACAAGGTGTACTTGA 

GTGCCGATGATCTGTTGC 
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GROUP 2 
SEQIDN022 

ctgaaccctaacgcacacaacttcactctttgctcctccaaatc.tctctccaatgcaggatt 

tcatcggctccgttcgccgatctctggttttcaagcagtccggagacttcgataccggcgct 

gccggtgtcggcagcggattcggaggcttcgttgagaaactaggttcgagcattcgcaaatc 

gagtattggaatcttctcgaaagctcatgttcctgctcttccgtctatttctaaagctgagc 

tgcccgcgaaggctcggaaagatgacactccgccaatccggtggaggaaaggtgaaatgatt 

ggatgtggtgcttttggtagggtttatatggggatgaatgttgattctggagagttactcgc 

tataaaggaggtttcqattgcgatgaatggtgcttcgagagagcgagcacaagctcatgtta 

gagagcttgagaaaqaagtgaatctattgaagaatctctcccatcccaacatagtgagatat 

ttgggaactgcaaaaqagqcaggatcattaaatatattgttggaatttgttcctggtggctc 

aatctcgtcacttttqciaaaaatttggatccttccctgaatctgttataagaatgtacacca 

agcaattgttattacgqrtggaatacttgcataagaatgggattatgcacagagatattaag 

ggagcaaacatacttgttgacaataaaggttgcattaaacttgctgatttcggtgcatccaa 

gaaggttgttgaattggctactatgactggtgccaagtcaatgaagggtactccatactgga 

tggctcccgaagtcattctgcagactggccatagcttctctgctgacatatggagtgtcgga 

tgcactattatcgaaatgqctacaggaaaacctccttggagccagcagtatcaggaggttgc 

tgctctcttccatatagggacaaccaaatcccatccccccatcccagagcatctttctgctg 

aatcaaaggacttcctattaaaatgtttgcagaaggaaccgcacctgaggcattctgcatca 

aatttgcttcagcatccatttgttacagcagaacatcaggaagctcgcccttttcttcgctc 

atcctttatgggaaaccccgaaaacatggcggcgcaaaggatggatgttaggacctcaatca 

ttcctgatatgagagcttcctgcaatggtttgaaagatgtttgtggtgttagcgctgtgagg 

tgctccactgtatatcccgagaattccttagggaaagagtcactctggaaactaggaaactc 

tgatgatgacatgtgccagatggataatgatgattttatgtttggtgcatctgtgaaatgca 

gttcagatttgcattctcctgctaattataagagttttaatcctatgtgtgaacctgataac 

gattggccatgcaaatttgatgaaagtcccgagttgacgaaaagtcaagcaaacctgcatta 

tgatcaagcaactattaagcccactaataaccccatcatgtcatacaaggaggatcttgctt 

tcacatttccaagtgggcaatctgcagccgaggatgatgatgaattgacagagtctaaaatt 

agggcattccttgatgaaaaggcaatggacttgaagaagctgcaaacaccactatatgaagg 

attctacaattccttgaatgtttccagcacaccgagtcccgttggcactgggaacaaggaaa 

atgttccaagtaacataaacttaccaccaaaaagcaggtcaccaaaacgtatgcttagcaga 

aggctctctactgccattgaaggtgcttgtgctcccagcccagtgactcattccaagcgaat 

atcaaatattggtggcctaaatggtgaagctattcaggaagctcagttgccgaggcataatg 

aatggaaagatcttcttggttctcaacgtgaagcagttaattcaagcttctctgagaggcaa 

agaaggtggaaagaagagcttgatgaagagttgcaaaggaaacgagagattatgcgtcaggc 

agtcaacttatcaccaccaaaggatccaattctaaatcgatgtagaagtaaatcaaggtttg 

catctcctggaagataaatgtatgtacttgtgtccctaaactaaagtcagtttgaagaatat 

aattaatgatcctgcaaccccagaacagagagttagatgtcttgagcaggtatacgaacgtg 

aggttttcttgacccgttactacaggaatatcagcgcttgtcagatagagtgagctgttact 

acaggaatatctgtcaacctgttaatcatattataaaatgccaataatttgcgttgtattcg 

ttttgatcattctcctgagagcattgtaagaaaaatgcaggcctttttataacctatataag 

tgctctctcatggtagttgccaatattaaaacgcagagaaaagtcgagttctcatctgctga 

attgtttgtaaaatgtgatatattaatgtatttaccgtcttacaacc 
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SEQIDN023 

ccacgcgtccgtgatatgggatgtcacattgatggatttattgctgtagttggacatacaca 
tgttcttcacgaaggaccagttactggtagacctgctgacgtcattgcagctgctaatacag 
ccgctgaagttgctttgaggcttgtgagaccaggaaagaagaactcggatgtaacagaagct 
attcagaaagttgctgctgcctatgactgcaagattgttgagggtgtcttgagccatcaaat 
gaagcaatttgttattgatggaaacaaagttgtattgagtgtgaccaatcctgaaacgagag 
tagatgaagcagaattcgaggagaatgaggtttactccattgatattgtgacaagcactggt 
gaaggaaagccaaagttgttggatgagaaacaaacaactatctacaagagagccgtggacaa 
gagctataacctgaagatgaaagcatcaaggtttatcttcagtgaaatcagtcagaagttcc 
ctatcatgccatttaccgcaagggatttggaggagaagagggctcgtctgggcctagttgag 
tgtgttaaccatgagcttttgcagccatatcctgttctacatgagaaacctggtgatttggt 
tgctcacattaaattcacagtgctgttaatgcctaatgggtcggataggattacatctcatg 
ctctccaggagctgaagcctgcaaagtcgatagagaatgaacccgaaatcaaagcctggctt 
gcccttcccgttaagaccaagaagaaaggcggtgggaagaaaaagaaagggaaaaaaggtga 
caagacagaagactcatctcaagctgagccaacggaaggatagagaaatggtttcaaatctt 
gataaatagcaattttgaggtgcttgatcgatcaacttcactgaaactattggttcactgtt 
ggtcggcactttcagctgcctttgttcttccttgtggggctttgctatacaagggacagaca 
gttattgtcctcttgtactgtcatgttaaattactcagttttccaatgctattcaacatgct 
ctcaatcggtctttaaaaaaaaaaaaaaaag 

SEQIDN024 

ccacgcgtccgcaaaaccctagctcaaatcccgtttgcctccattttcattccatcaacaaa 
aacctaagtttatactcagcttgagacatttgataactatgtcggacgacgagagagaagag 
aaagagttggatctgacaagtcctgaggttgttactaagtacaaaaatgctgctgaaattgt 
taacaaggctctgcagttggtggtgtctgaatgcaagccaaaagcaaagatagttgatcttt 
gtgaaaaaggagatgccttcatcaaagagcaaactgggaatatgtacaagaatgtgaagaag 
aaaatcgagaggggtgtggcatttcccacctgcatttcagttaacaataccgtgtgccattt 
ttctccactgtctagtgacgagacagtattggaagaaggtgatatggtgaagattgatatgg 
ggtgtcatatagatggctttattgctgtagttggtcatacacatgtgctccaggaaggacca 
gttactggtagagcagctgacgttgttgcggctgctaatacagctgctgaagttgccctgag 
gcttgtgagaccaggaaggaagaactcggatgtaacagaagctattcagaaagttgctgcgg 
catatgactgcaagattgttgagggtgttttaagccatcaaatgaagcagtttgtgattgac 
ggaaacaaagttgtgttaagtgtgtccaatcctgaaacgagagtagatgatgcagaatttga 
ggagaatgaggtctattcaattgacattgtaacaagcactggtgaaggaaagccaaaattgt 
tggatgagaaacaaacaaccatctacaagagagctgtagataaaagctacaacctgaagatg 
aaagcgtcgaggtttattttcagtgaaatcagtcagaagtttcctgtcatgccatttacagc 
aagggatttggaggagaagagagctcgtttgggactcgttgaatgtgttaaccatgagcttt 
tgcagccctatcctgttctacatgagaaacctggtgatttggttgctcacataaaattcaca 
gtgctgttgatgcctaacgggtcagataggatcacaactcatactctccaggagctgaaacc 
tgctaagacaatagaggatgaacctgaaatcaagacctggttagcccttcccgtaaaaagca 
agaaaaaaggcggcgggaagaaaaagaaagcgaagaaaggtgagaagacagaagactcatcc 
caagctgaaccaatggaaggagaatcaaatggtgctgaatcttgatatgttgctagaacttt 
gatttgattcaattccaagaactatttgttgattgttagttaaatgtgggatattgaggtag 
ttgtggatctttctttgcggcattttgcaatacaagaatggcatggacagttgttgtccttg 
tcttgacacatttgtcatgctggaattattaagtggggtttccaatgctataatgtcatgtg 

tatcaaaaaaaaaaaaaaaagggcggcaactctagagtatca 
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SEQIDN025 

ccacgcgtccgcttggggattagcaggttgtcgacaaagaaaattcatttgtttcctacgat 

cacacaagtcgtggttgttgcagatccgctttcgctaaggggaaactcaaaagcccagttcg 

tgtagttcatccaaagatgagttccagcaaaagggttgggaagtcttctaattcatcaggaa 

agcagaaagctatatgcgaaacaactactacaccaacggttgat'gatataaatgtaggcgta 

gaagatatggggttgaactccgatcaaaatgatggatggatagtgtgttctagaaagtccaa 

gaacaagggtggaagcagcagtgctggaatgaagcaatggatttctcagaatcccactccaa 

aagccaaactgggaatgcgtaacaatattgttggatcatcaggacaggggtctaggaataac 

tggtccacacctaattatcatcctcgaaaacctgctggcagagaatgctacacaccgacacc 

cgctgcagttcctcctgccctgaagaatggttgggattggtcatctgtcgctcgttccaatg 

aggaccatgatacttattcccctgtcgctgatgtcaaggcttcctgtgaacatgatggagag 

gataatgaatcggatttgcctgatgatgacagtgatgatgagcttccgagtgatgacgactt 

tgatgatcactcggatgtaaatgaaatgagtcatgaggtactcaaggaaagtcgttggttca 

agaaccttttcaaatgtcttgacagtttgactgtcactgagattaatgatccggaaagacag 

tggcactgccctgcatgcaaaggtggtccgggtgcaattgagtggtttccagggatacagtc 

agtgatgaaccacgcaaaaacgaaaggatttaggatgaaattacacagacaacttgctcaac 

ttttggaggaagagctgcgtcggaggggaacttctgttgtacctccaggtcaagtgtatgga 

agatggggtggcggtgaatatgaagataaggaaatagtgtggccaccgaccgcgattatcgt 

gaacacagtgcttgagaaagatgaaaatgacaagtggattggaatgggaaatcaggagctgc 

gtgattatttcagctcttatgctgctgtcaaggcagcgcgaagctcatatggtccacaaggc 

catcgtggtatcagtgtgttgatttttgaggccactcccgtgggatacatggaggctgtact 

tctcagtgagcagttttctgaaaaaggaagtgatagagatgcatgggaacacaatccagttc 

tcttttatcctgggggaaaacgtaagctttatggttacatggcagagaaaagagacatggac 

aactttaaccggcattcacatgggaaatcaaggctgaagttcgagatgaggtcatataaaga 

aactgtttcgaatccagcgatgcagatgtcggaggataatcaacagctcatatggttcaaga 

accaagcctctaagcaccaaaagcgggctaaagctactgaagagtctctaagactggtgagt 

gaaaagcaccgtcagacagtcgaagagaacaagattgtcagactgagaactaagatgcacca 

tgaacggaacaaggaagagatggaatatctagagcagttttttaatgatcagttgaaaatga 

tttatgatgccaggactgctgaggaggacaagtttgaaaagatacagcaggaacagcgtgag 

atgatctatcaatctaatgcaactatttcctcggctgaggatcatcgactcagggcagagaa 

agttgcgaaatttatcaaacttcaggacaaggatatggaagaatttgtggaagagagggata 

atctgataagagctcatgaagatagggtaggttcaatgagacgcaaatacttgctgcaatac 

tcggaagaggcagttgcacttgagaagaattttgatctcgaactggctaagctgatggagaa 

gtactcatcaaagcaatctgagcaggtcaacagcagtgatgccgtgtgaccctatagtaata 

ctattcaagcgccgttttagctttaaatttctgtgaacttgggattcttcactgacttttat 

aatcctggtctgtccatgtgttttgatgatgctaaagaaatgattctaatagttatattata 

tcctaaaacatatggcttgaactatttgttctagaaaaaaaaaaaaaaaaaaagggcggcc 
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SEQIDN026 

ccacgcgtccggccatggtagcaaaacagttagctgatgacgaaccacaaaaaaccctcaag 

gattcaccaaagtttgaatccaaatcccataagaaaaaacataagagaaagctcgaagaccc 

tgaacctgaagaagttactgttatagagtccaagaaagagaaaaagaagaagaaaaagcaga 

aacagaaccaagaacaagaagggtctattgtaaacagtgaaaatcttagtgggtctaatggc 

aaggttgaaactattaatgggtcagctgagttctctgaaaaaagtagtacaaatgtggtggt 

aactggtaaggatgctaatgagtcaaagtacaaagctttagcaaaatttgtggattcagggc 

ttccaagtgatgtgttagattgttgcaagaattttgagaaaccatcaccaattcaatcacat 

tcatggccttttcttttagatggccgtgatttcattggaattgccaaaactgggtcaggtaa 

gactttggcttttggtattccggctattatgcatgtcctgagcaagagaaagagtaaaaagt 

ctaagaatccqctttgcctcgtgctttcgcctacaagggagctagctcaacaaatatcagat 

gttctctgcgatgctgggaagcctactggtgtgcagtcagtttgtctatatggtggagtcga 

taagcatcatcaaaaagcttctcttaaatctggtgtggatattgttattggaacccctggtc 

gtttgcaggatatgatggaaatgggagcatgcaacttaaaagcggtttcttttgtggtgcta 

gatgaagctgatcggatgctcgatttaggttttgaacctgctgtccgtgccattttgagcca ^ 

aacatgctctgttcgacaatcggttatgttcagtgctacatggcctccggctgttcatcaat 

tagctcaagaattcatggatcctcatccaatcaaggtagttgtaggttcagaagatttggct 

gccaaccatgatgtcatgcaaattgtcgaggtcttggaagatcgagcccgtgatgagcgttt 

acagtgcttgctggaaaaataccacaagtttagaaagaacagagtattggtttttgttttgt 

acaagaaggaagcatctcgggttgaaattatgctacagaaaaggggttggaaagttgtgtcc 

attagcggtgacaagcaacaacatgctcgtactaaggcgttgtcactctttaaggatggaag 

ctgtcctttaatgatagctactgatgtagctgctcgaggtctggatatcccagatgttgaag 

ttgtgataaattatagttttcctttgacaacagaggattatgttcatagaattggaagaact 

gggcgagctggtaaaaaaggtgtagctcatacattcttcactaaggacaacaagggactttc 

tggggagttgataaatgttctcagagaggctggacaggttgtgccagctgcccttcttaatt 

ttggaacccatgtaaagaaaaaggaatcgaagctctatggtgctcattttagagaaatagat 

gcaaatgctccaaaggctacaaaaataaaatttgacaattctgatgaggaagattgagaagc 

aatatcattattaccaaagcaacacaactccattgaattggctcatcatcctgacattccgt 

gcaatcatttggcggatacatgtagaagtggattactgcgggaagaatgcaagagatatctc 

actgctcatgtatatggtaattgaagcttaaatctattggcgcttcaacctgtcatagataa 

tgagtttgaaatactattgtgtttttgtaccttaatattcttttcacccatacagttggctt ^ 

agtaaggtttttctaggatccaaatgtagtaatacacttattataatttgcccttttaagtg 

atgtatgtatgattgcaccttccaaattactgcacttggcaaaaggtggaaaaatattcgaa 

aatgagattcaaaactggttcatgcaaaaaaaaaaaaaaagggcggcc 
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ccacgcgtccggtggaacgccacgttgccattctctcttcggtgacaagcttcaaacgccag 

gcgtctctctcgtatctgagtgctggtgttttttcttctcaggttcaagttccggcagttaa 

tgcggcaaattcaaatttagatgttatgcaatagctgcgtaagatttgtgttttttcaagca 

gcacatattgatacacacattgtgcaaaggcaatttatcaaactcattaaaagtttgaatgc 

aactcgtagcataagttattcttggaatagtgtatatacagcaaggagaacgatgatggtgg 

atactggagcaactgctaaaggaggacctgtcgttgatgtttcaccggagaaggatgataat 

aatggtggtttcgctagcggaggatggaagagtgaagatggaagactgagttgtggttattc 

aaactttagagggaaaagagccaccatggaggatttttatgacattaaaacttggcaaagtt 

gatggacaaacaggtagcttatttgggatatttgatggccatggtgactctcgcacagctga 

gtttctgaagaaacatctctttgagaatctaatgaaacatccagagttcccaacgaacgcca 

agctasccataagtgaaacatatcaacaaacagacatggacttcttagattctgaaaaagat 

accttccgaqatgatqgttccactgcttcaaGagcagttctagttggtaaccatctctatgt 

tgccaatgttggagattcacggactataatatcgaagggcggaaaagcaattgctctttctg 

aggatcataagcccaatcgaactgatgagaggaagagaattgaaagtgccggaggtgttgtg 

atgtggqctggtacctggagagttggtggtgtattagcaatgcacgtgcttttggcaaccgt 

atgttgaagcaatttgttgtggctgaacctgagattcaggatcaagagattgatgaggaatt 

agaactactcgtgcttgccagcgatgggctctgggatgttgtaccaaatgaggatgctattt 

cacttgcacaagcagaagaagaaccagaagcagctgctaggaagctaacagaaactgcattt 

actcggggtagtgctgacaatattacctgcatagtggtgaagtttcaccacaagaaggttga 

accagaggggagccagcaaggttgaagaatttgttgatgctgcatctgccttttcctggtgg 

aaggctgcttcaatgatgccggtgcaagttgctgacgatagcatcacaggggctgtcatttt 

ttcattcatttctttgcattgtttttccccgtcatcctgtttaactgttgtatttaaggtgt 

ctgcgtttgtgcgtctgctttctccttttctgtagaggtattgtctggataaactttactgt 

gaaacgtagttaaaaggttaaaaaaaaaaaaaaaag 
SEQIDN028 

ccacgcgtccggaagaaatggttgaattcatggaaaaggtcttcaactccctcggctcagaa 
gaactcaccgtggaggaacgaaacctcctctccgtcgcgtacaagaacgtgatcggagcgcg 
tagggcatcgtggcgtattatctcatcgattgagcaaaaggaagagtccagagggaacgagg 
aacacgtaaactctatccgcgagtacagatctaagattgagaatgaactctctaagatctgt 
gatggcattctgaaattgctcgatgcaaagcttatcccttctgcagcatctggtgattctaa 
ggtgttttacctgaaaatgaaaggagattaccaccgctatttggctgagttcaagaccggtg 
ctgaacgtaaggaggctgctgagagtactctcactgcctacaaagctgctcaggatattgca 
actactgagcttgccccaacacatcccatccgacttggactggctcttaacttctctgtgtt 
ttactatgagatcttgaactctcctgaccgcgcttgcaatcttgctaaacaggcctttgatg 
aagcaattgctgagttggacacactgggcgaggagtcttacaaggatagcactttgatcatg 
caacttcttcgtgacaatctcactctctggacatctgatatgcaggatgatggggctgatga 
aatcaaggaagatcccaaacctgatgaagccaaaaattgaaggaattgaaactctctaattt 
gcttttcacttcttcctggttgtttttattggaagaagctgattatcgtaatttcctttact 
attatggttttccgctagggggttgtcttattggaaatgaacaacttttaatattgatgttt 
cagaagttccatctttaatttaatgtggtttttctggtggtaaaaaaaaaaaaaaag 
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ccacgcgtccgctactgtttcttcatcgctatgccgtcagttccgcttcctacttgactgaa 

tctgccgccatggaaggggatatctacacatctaactgcagaattgttacataaacactgat 

gggaatgcagaagagatattctcaaatattcaaggtagcattttgaaaaatgtcagatgata 

tggtcattcattttgcatccaattcttcaaaccaatcagaccagtctctgcccacaaagatt 

gctaaacttgaggcaagaatggtgggcaaagcctcatctacatctacatcccgagctacttc 

ctggtctgccccagccaagtttgggcttgggcctgggcctgctgacaatgttgctgagcttg 

ctgtctctagtgattctgatgatgatgatgataatggaagggaatttctcatacaagcaaac 

actcagaagcggcgcaaactcgaggatgacaacagctcaacttcatttgaacatgtggagac 

agcagctgatactgtgaaaaagatagtagacaatacagacacaagcaaagtgggttcagatg 

tgaatagacggaaacaaagccgtgtcaagggacaaactaattctggtagaggacgtggttcc 

cgagttagtgatcagaccaagtcacaagcagtttctgtgtcaaatggtcagctcgagaactc 

ttaccagaaggatggtttgccaaaagagcaaattgggcacgatcgacagactgtattcgaag 

aggagatcacttctttacgggcaaaagttgtggctttggaggaagagcttaagaaatcccgt 

caagaggcatcagattatcaacatcagtgtcaacagctggaaaaggaattgaaggatcttaa 

agattatgagcagcagacaaagccaaagagaacgaaaataatatctgaattgttaatatctg 

tttcaaaagctgagaggcaagaggcacgaatgaaagtgcgacaggaatctttgagactgggc 

aacgtgggagtaatcagagctggaaccattatttctgaggcctgggaagatgggcaagcact 

aaaggacctcaatgctcagcttagaaacttattagaaactaaagaagctattgaa.cggcagc 

gtaaattgctcaagaaacgacaaccagataaaagtgatggaggagatgtggagggaggtttg 

caggaagaagattctctcattcaggatgagatctacaaatctcgtttagccagcatcaaacg 

tgaggaagatgtgataatgcgtgagagggaccgatatgaactagagaaaggaaggctaattc 

gtgaaatgaaacgcatacgtgatgaagatggttctcattttaacaattttcagattttgaac 

caccgatatgccctcttaaaccttcttggaaaaggaggatttagtgaggtgtacaaggcttt 

cgacttggtagaccatagatatgttgcatgtaagctacatggactaaacgctcagtggagtg 

aagagaagaagcaaagttatatacggcatgcaatcagggagtacaacatccacaagactttg 

gtgcaccatcacattgtgcggctttgggacatttttgagatagaccaaaacaccttttgcac 

tatcttggagtactgtagtggaaaggaccttgatgcagttctcaaagcaacacctgtgttgc 

cagaaagagaagcaagaatcatcattgtgcagatttttcaaggccttgtctacttgaataag 

aagtcacagaagatcatccattatgatttgaagccaggcaatgttttatttgatgagtttgg 

cattgctaaggtcactgattttggccttagcaagatagtggaggatgatgttggatcccagg 

ggatggagctaacatcccagggagctggaacgtactggtatctacctcctgaatgctttgag 

ctaagcaagacacctcttatatcctcaaaggttgatgtctggtcagctggtattttgttgta 

ccaaatgctgtttggcaaacgtccctttgggcatgaccagtcacaagaaagaatactaaggg 

aggacacaattattaaagcaagaaaggttgaattccctacacgaccagctgtctctaatgag 

gcaaaggagttcattcgtcgttgtttaacatataatcaagcagataggccagatgttttaag 

tattgctcaagacccttacttgacatactcaaagaaatgataggaggatgttaatcccaact 

acttggacagagggtattgggacgaggattggtgctcaaaggaattttgtatagttgtaaag 

ccatgtaattttttgtccctgtaccttcgactagagtggggcggctcaaggggagctttgct 

ttaggccccaaaattttgggggcatttgcatctatacccagtttttgggttaacttttaact 

tatatccgcattgcaaaaaaaattgcaagcatacctacttttcgggtaacttcagacattcg 

ggtctgaagtagcaaaaatttatgtctgaagtttgaacttcagaatgttttgcctgaagtgt 

agtaaaacttcagatatttttgcctgaagtttggcctgacttgcaaagtcaatcacgcaaac 

ttcagttcatagtgcaatggcaaacttcagctcaataaaattacagcatgttttggctgaag 

tttttgttttgtaattgttgaacttcagcattttaggaactgaagtttgttttgtaattgct 

gaactttagcattctaggggtgaagtttgttttgtatttgctgaacttcagcattctaggag 

ttgaaatttttgtttatatttgctgaacttcagcattcttagagctgaagttctaagtctgc 

acacggaaatgaggaagataacc 
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ccacgcgtccgacgaaatccaaccgtcgaatctcaggcaacaggcggcagctcatttcaccg 

ctgtaacaaaaattcgagagaatggcaatggtagatgagccattgtaccccatagccgtgtt 

aatagatgaacttaagaacgatgatatacaattacggttgaattcaattaggaggttatcga 

ctattgcacgtgcccttggtgaggaaagaactcgaaaggaattgatcccttttttgagtgaa 

aacaatgatgatgatgatgaggtgttattggcaatggctgaagagcttggtgtgtttatccc 

ttatgttggaggtgtagagcatgctcatgttttgctcccgccgttggagacgctttgtactg 

ttgaggagacctgtgtgagggataaagctgttgaatcgttgtgtaggattggatctcagatg 

agggagagtgatttggttgattggttcgtccctcttgtgaagaggctggcagctggtgaatg 

gttcacagctagagtttctgcctgtggactctttcatattgcttactcaagtgccccagaga 

tgttgaaggcagaacttcggtctatttacagtcaattgtgtcaagacgacatgcctatggtg 

cgaagatcagctgccacaaacttggggaagtttgctgctactgttgaatctacttacctcaa 

gagtgacatcatgtcaatatttgatgatcttacacaggatgatcaggattctgtacgcttat 

tagctgttgagggctgtgctgcacttggcaagctgttggagccccaggattgtgttgcacac 

atcctgcctgtcattgtcaacttctctcaggacaagtcttggcgcgtccgctacatggttgc 

taaccagttgtacgaactatgtgaagctgtagggcctgagcccactaggacggatttggtgc 

ctgcctatgtccgtttgcttcgagataatgaagctgaagttcgcatagctgctgcagggaaa 

gtcaccaaattctgtcggattcttagtcccgagcttgctattcagcatattcttccctgtgt 

gaaggaattatcatcagactcttcacagcatgtcagatctgctttggcttctgttataatgg 

ggatggctcctgttttgggaaaggatgcaaccattgagcatcttcttccaatatttctttcc 

cttctgaaggacgagtttcctgatgtgcgcctgaacatcattagcaagcttgatcaagtcaa 

tcaggtgattggaattgatttattatcccaatctttgttgccagctattgttgagctagcag 

aggacaggcattggcgagtccgtcttgcaataatagaatacatacctctattggcaagtcaa 

ttgggcataggattttttgatgataagcttggtgccctttgtatgcaatggttacaggacaa 

ggtttattcaatcagagatgctgctgctaataacctaaagcgtcttgcagaagaatttggtc 

cagagtgggcaatgcagcatataattcctcaggtcttggatatgactaccagtccacattat 

ttgtatagaatgacaattcttagagcaatttcattgcttgcacctgtaatgggctctgaaat 

aacttgttctaaattgctgcctgtggttattactgcaacaaaggatagagtgcccaacatta 

aatttaatgtggcaaaggtgttgcaatcccttatacctattgttgaccactcggtggtggag 

aaaaccattcgccctagtttagtagagctagctgaagaccctgatgttgatgttcgcttt-ta 

tgccaatcaagcacttcagtcaattgataacgtcatgatgtcaggctagagaatataactgt 

ggtgagagtactacaaatctctcttcaaatccctctttggtaggattttgctctcacacgaa 

gacgcaaaagagaaaatgtgcaagcaaaatgcattctgttgagcttggagtcgtatattgtt 

actaattcttttgtaggatttgacattcaagatgctgtgacactaatgaacaccgagtgttt 

tttcatgtaaagttactgccgtactatttagatctgctaagctcatgtatttgcttttgtta 

gtgtacttttttggtgt-ttgaacttacaactttttacctgcgttattctagcagatttgttg 

cgtttacattagcgtttgcgtttcttcctagccgatgttatgtttgagcagtgcccccgcca 

ccctctctttttctcaggtcttatgctttctatgtgttttttcatgccgatagaatgtatgt 

ggaacttttagtacttattattttttatgttgtatttgttggcttgagatgagcaacataaa 

■fcaataagaaactggg 
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ctgtacaaaaaagcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtcc 
gaggagattgagctgagctgactcaatgtttccgagattgattcaaccacaaggggaagatg 
aatataatatgaatgttgggattcatcatactcataatattaatggagatccttgccttgtg 
ctgacgtcagatccgaagcctcgacttcgttggactgctgaccttcacgaacgcttcgttga 
tgctgttactcagcttggcggtcccagcaaagctacgccaaaggcgataatgcggacaatgg 
gtgtcaagggactgaccctcttccacctaaagagtcaccttcagaaatacagactaggactt 
acagctacatattcattagagagcccttgttctggtggtactcctcagcagttgccggcatc 
ggacttgaatgaaggttatgaagtcaaggaggcattgagagctcagatggaagtgcaaagta 
aattgcacctgcaagttgaagctgagaagcacttgcaaattcggcaggatgctgaacaaagg 
tatattagcatgctggagaaggcctgtaaaatgcttgctgatcaattcattggtggtgtagt 
tactgaaaatqatcaagagacttgccaaggattaggaacaaggacacaagttagccctcttt 
gtaatccacttggattatgcccctcggaatctgctgatcttgttggaatccatggtccagaa 
gaagtttcccccagaatccatccacaattcaccgattgttccactgaaagctgcttaacttc 
gcatgagagtcctgctggacttcccctagaaggaacttcacctggaggaagaaaacgagggc 
cgaatggagattcaacacatgcatcagttgttggggtgaagcagatatgatatcgtcaggtg 
ttcgtctgcttcaagttaatcgctttgggattactagctctaatgttcaaaatgtctcttct 
taagagattagtgctgagtttatctacagccattgattctcaaactgcatattgcggtttct 
gggaatactgatgggccttggacttgtcaagttgtaaatgcaagctgatgactttctaactc 
taactgcgccccctgaacattaaatcctaaaaaaggaagaaaattgagatgcgag 

SEQIDN032 

agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgaaagaagaga 

aaaagatgggtgctgacaaagggaagaagcaaaaagtggaggaagagaacaacaccattgat 

ggtgagctcgttttttccattgaaaaattgcaagaaatacaagacgagctcgagaagatcaa 

tgaggaagcaagtgataaagtattggaagtggaacagaagtacaatgagatccgcaagcctg 

tctatgacaaacgaaacgacatcattaaagctatcccggacttctggttgactgcttttttg 

agtcatcctgtcctaggtgaacttctaactgaagaagaccaaaagatcttcaagtttctaag 

ttctattgaagttgaagactctaaagatgtgaagtcgggctactcgataacctttaacttca 

atgcgaatccttattttgaaaatacaaagctcacaaagacctataccttccttgaagatgga 

cccacaaagatttctgctacaacaataaaatggaaagaaggcatgggcattcctaatggatt 

tgcacatgagaagaaaggaaacaagcgatctcatgctgaggaaagcttcttcacatggttca 

gtgaagtcaatcaaaaagatgaggatgaggatgaggccctagagattcaggatgaggtcgct 

gacataattaaggatgacttgtggGcgaaccctctcacctattttaacaacgagcctgatga 

agaagattttgatggtgacgagggaaaggacagtgaaggctctgaagacgaagaggaagaag 

aagaggaggatgaggatggtgatgaagaatgaaggcagtaaactgttcaagacccctatttt 

gggatctcgtcttcagcggttttaatcatcagggtttaatgtctgtaaagaggctttgaatg 

ttgccaaagaacagaataactgtggtgactataccttttcttctcttgtatggttataactt 

ataagcaaaatatctaattccggaggttccaaaatgttttcattaggctagttcgattaatg 

aagtgtttgtctggcaaaaactgataatgttaggttattgagttatg 
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ccacgcgtccgcccacgcgtccgggcagctcatttttaccgccgtaacaaaaactcgagaga 

atggcaatggtagatgagccattgtacccaatagccgtgttaatagacgaacttaagaacga 

cgatatacaattgaggttgaattcaattaggaggttatcgactattgcacgtgcactcggtg 

aggaaagaactcgaaaggaattgatcccctttttgagtgaaaacaatgatgatgatgatgag 

gtgttattggcaatggctgaagagcttggtgtgtttattccttatgttggaggtgtagagca 

tgctcatgtcttgctcccgcctttggagacgctttgtactgttgaggagacttgtgtgaggg 

ataaggcggtggaatcgttttgtagaattggatctcagatgagggagagtgatttggttgat 

tggtttgtccctctcgtgaagaggcttgcagccggtgaatggttcactgctagggtttcagc 

ttgtggactctttcatattgcttactcaagtgccccagagatgttgaaggcagaacttcggt 

cgatttacagtcaattgtgtcaagacgacatgcctatggtgcgaaggtcggctgcgacaaac 

ttggggaagtttgctgctaccgttgaatctgcttacctcaagagtgatatcatgtcaatatt 

tgatgatcttacacaggatgatcacgattctgtacgcttattagctgttgagggctgtgctg 

cacttggcaagctgctggaaccacaggactgtgtggcacatatcctgcctgtcattgtcaac 

ttctctcaggacaagtcttggcgcgtgcgatacatggttgctaaccagttgtatgaactatg 

tgaagctgtagggcctgagcccactaggacggatttggtgcctgcctatgtccgtttgcttc 

gagataatgaagctgaagttcgcatagctgctgcaggaaaagtcaccaaattctgtcggatt 

cttagtcccgagctagctattcagcatattcttccctgtgtgaaggaattatcatcagactc 

ttcacagcatgtcagatctgctttggcttctgttataatggggatggctcctgttttgggaa 

aggatgcaaccattgaacatcttcttccaatatttctttcccttctgaaggacgagtttcct 

gatgtgcgcttgaacatcattagcaagcttgatcaagtcaatcaggtgattgggattgattt 

attatcccaatctctattaccagctattgttgagctggcagaggacaggcattggcgjagtcc 

gtcttgcaataatagaatacatacccctgttggcaagtcaattgggcataggattttttgat 

gataagcttggtgctctttgtatgcaatggttacaggacaaggtttattcaatcagagatgc 

tgctgctaataacttaaagcgtcttgcagaggaatttggtccagagtgggcaatgcagcata 

taattcctcaggtcttagatatgactaccagtcctcattatttatatcgaatgactattctt 

agagcaatttcattgcttgcacctgtgatgggctctgagataacttgttccaagttgctgcc 

tgtggttattcatgctacaaaggatagagtgcccaacattaaatttaatgtggcaaaggtgt 

tgcaatcccttatacctattgttgaccactcggtggtggagaaaaccattcgccctagttta 

gtagagctagctgaagaccctgatgttgatgttcgcttttatgccaatcaagcacttcagtc 

aattgataacgtcatgatgtcaggctagagaatataactttggtgagagtactagaaatctc 

tcctcaaatcctctttgatagtcttgggattttgctctcacacgaagacacaagggaaaatg 

tgcaagcaaaatgcattctgttgagcttggagtcgtatattgttactaattcttttgtagga 

tttgacattcaagatgctgtgacactaatgaagaccgagtgtttttaaatgtaaagttgctt 

ctgcactatttagatctgctaagctcatgtatttgtttttgttagtgtacttttttggtgtt 

tgaacttcccacgttttctgcg 
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SEQIDN034 

tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 
ccgagaaattagcagttagagacactgagaagcagcagctctcttcctcagctgctgtgtgc 
ttaggcaaagaataaaatgggggcagacaaagggaagaagcagaaagtggatgaggaaaaca 
acaatgttattgatgaaaagctcattttttccattgaaaaattgcaagagatacaagacgag 
ctcgagaagatcaatgaaaaagcaagcgacgaagtgttggaagtagaacagaagtacaacga 
gatccgcaagcctgtctacgataagcgaaatgatgtcattagctctatttctgacttctggt 
tgactgcttttttgagtcatcctgttcttggtaaccttctcactgaagaggaccaaaagatt 
ttcaaatttgtaagttctattgaagtggaagactcaaaggatgtgaaatcgggtcattcaat 
cacgtttaactttaagcccaatccttattttgaaaattcaaagctctcaaagacgtatacct 
tccttgaagatggacctacaaaaattacagctacaacaataaaatggaaagaaggcatgggc 
attcctaatggagttgctgacaagaagaaaggaaacaagcggtcccacgctgaagaaagttt 
ctttacatggttcagtgaagtcaatcaaaaaggtgatgtggatgatgacgaaaatgagattc 
tggacattcaggatgatgaggttgctgaaataatcaaggatgacttgtggcctaaccctctc 
aattattttgaccatgagcctgatgaagaagatattgagggcgatgagggaaaggacagcgg 
aggctctgaagaggaagaagaagaggaagatgatgaagatgaagaagacgaatgaactgttg 
gtagaccttgtgtttgatttgagttctcatcagtgtttcaatcatcagagttggtctctgta 
aagaggtttcggatattgcagaaaaattgaatgacatatagtggtgactctaatttttagtt 

tcagtga 
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ccacgcgtccgatcttgaaaaagttccattcttttttctccttctgcttcttcttctgattg 
aagattctgaacctgttctaagtttatggataggtggaatacttcactgagtggatattaca 
actacccttttcgattcttgcatttttattccatttttgttgtgattgtgttttcttccatt 
ttccctattatatcagctgggaggaggtcagatggggttattgt'aactcaagctgattttca 
agcacttaaggccattaaacatgaactgattgattttagaggaatcttgacaagttggaatg 
atagtggtttaggagcttgtgctggtggatggataggtataaagtgtgttaatggggaagtt 
atagctatacagttgccttggaagggattaggtggcacaatctctgaaaaaattggtcaatt 
acaagctcttagaaagcttagtattcatgacaatgttattgctggtcttgttccaacttcgt 
tgagtttccttccaaatcttagaggtgtttatcttttcaataaccggctttcgggttcaatc 
ccaccaaccattggcagatctccacttcttcagactcttgatcttagcaacaatcagctcac 
tggtactatccctcctagtcttgcaaattccacaaggttatacagactcaacttgagctaca 
atgcactttcaggttcaatcccagtaagttttactcaatccccttctcttacttttcttgca 
cttgaacataacaatctttctggctctattcctgatacttggggtaatgttgttgtgaacga 
taagtcttatcaacttcagtatcttacccttgatcacaatcttttatatgggaaaattccag 
cttcaattagcaagttaagtatgcttgaggagattaatcttagtcataaccaaattaatggg 
actattcctgatgaattaagtgcacttacaaggcttgctcttcttgatttatctaataattc 
cataaatggaactattcctgttagtttctccaatctttcagctcttgttactttgaatttaa 
agagcaatcttttggataaccaaatcccagatgtcatatatagattgcaaaatctttcagtg 
ttggatttgagtaagaataagctcactggccatattcctgccaccattgggaatatttctag 
get caact cacti tgattt at ctgaaaacaatttcactggtgaaatcccaaactctcttgttt 
ctttggcaaatttgacttctttggatgtctcttataacaatctttctggggttgtcccatct 
cttctttctaagaagttcaattcaagtgcttttgttggaaatctagagctatgtggatatag 
tccctcaactctatgtgcttcaccacctcctcaaactcttcctccttctcctattggtgggg 
ttgccaagcctcgccatcgcaaacttagtactaaggatatcattctcatagcatctggagca 
cttctagttgttctacttcttttgtgttgcatgctactttgctgcttgattaggaaaaaggc 
aaattcgaaagcaaaaaatggtagtaaagccagtggcttagctaccacagggagaggtgcaa 
agccggttccagcagcagcagcaggtgctgaagttgaatcaacaggtggaaagctagtccat 
ttcgatggaccattcgtgttcacagcggacgacttgttatgtgccactgcagagataatggg 
aaagaacacttatggaacagcatataaggctacattagaggatggtaatcaagttgctgtga 
agaggctgcgcgagaagatcacaaaagggcaaaaagagtttgaagctgaagttgctgaatta 
ggcaagattcgacacccaaatattttggctcttagagcctattacttgggacctaaaggaga 
gaagcttcttgtctatgactacatgcctaatggaagtctctcatccttcctccatgctcgag 
gtcctgagacaacaatagactggcctacaaggatgaggattgctattggcataacaaaaggc 
atctgctttttgcataccaaagaaaacataatacatgggaatcttacatcaagcaacatact 
acttgatgagctaaacaacccaaagattgcagatgtaggcctttctaggcttatgacaagtg 
ctggaaacaccaatgtgattgccactgcaggcacactaggttatcgtgcaccagagctttca 
aagatcaagaatgcaagtactaagaccgatgtctatagtgttggagttatcattttggagct 
cttaactggaaagtcacctagcgaggcaacagatggactcgatttgccacagtgggttgctt 
ccattgtaaaagaggagtggactaatgaagtgtttgatgtggaacttatgagggatgcccct 
aatattggtgatgaattgcttaatactttgaaactagctttgcattgtgttgatccaacacc 
aactgctcggcctgaagctgagcaagtacttcagaaattggaggagattaaaccagagctga 
tgttagcaccccccagttctggaaatgatggcgctgcagttcaagaaaaaaatgaataaact 
cagtaaggtttgattgctaaaagtgtattgaaaaaggtttaggagttccagcttttttactt 
gattgacacccacctatttattctttcatttttttttttgatccagtggagtgagttgttgt 
ctcctattagttctattagtaaactgtatatccgagcttctgattgctgcatagatgcaaaa 
cgcattttgttcaattccctctattctttgcaatgtaatgcaataatagtatctatcttttt 
gatgacatcaacacacgccacgtg 
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ccacgcgtccgccgtgatgtaatcttggtgatgctacttattcccttttcccttcttgagcc 
caaactcaagaaggtcaaaaacaaaaaaattacaaaaagctggaatcttgcagtttttttat 
ttaatttatttatcctatgttgaattaatttttggggtcaatatttcccaatttgtagtctc 
caatggagcctcgtgttggtaataagttccggcttggccggaaa'atcggtagcggttctttt 
ggagagatctatctcggcgctaatgttcaaactaacgaagaggttgcaattaagctggaaaa 
tgtgaaaacaaagcatcctcaactattatacgaagcaaagttgtataaaatactacaaggag 
gaactggaatccccaatttaaaatggtttggagttgaaggagattataatgcccttgtgatg 
gatttgctggggcctagtcttgaagatctcttcaacttctgcagtaggaagctgtctttaaa 
gaccgttctcatgctcgcagatcagatgattaatcgggttgaatttgttcatgccaaatctt 
ttcttcatcgagatataaaacctgacaactttcttatgggattaggaagacgtgcaaatcag 
gtctatatgattgattttgggctggccaagaagtatagagactcatcaactcatcagcatat 
tccgtatagagaaaacaaaaatttgacaggaactgctagatacgcaagcatgaatactcatc 
ttggcattgaacaaagtcgaagggatgatttggaatcgctgggttatgttttaatgtacttc 
ttaagaggaagtctcccttggcaggggctgaaagcaggcactaagaaacagaagtatgagaa 
gatcagtgagaagaaagtatcaacatcaatagagaccttgtgtaggggatatcctgcagagt 
ttgcatcatattttcattactgtcgatcactaagatttgatgataaaccagattatgcttat 
ctgaagagaattttccgtgatcttttcattcgtgaagggtttcaatttgattatatatttga 
ctggaccattttgaaatatcagcaatcacagcttgccaatcctccatctcgtgctcttggtg 
gtactgctgggccaagctcagggatgcctcatgctcttgttaatgttgagaggcaatcaggt 
ggagatgaaggtcgaccaactggttggtcttcatcaaatcttacacgtaataagagcacggg 
gctgcatttcaattctggaagcttattgaagcaaaaaggcacagttgctaatgatttatcca 
tgggtaaagagttatccagttctaattttttccggtcaagtggaccattgaggcgtccagtt 
gtctctagcatccgagacccagtgattgcagggggtgaacctgacccctccggcactctgac 
aaaagatgcaagcccgggaccattgcgtaaagtatccagtgctgcacggaggagttcaccag 
ttgtgtcctcagatcacaagcgcagctcctctatcaaaaatgccaacataaagaatttagag 
tccaccgtcaagggaatagagggtttaagttttcgatgatgagggactgcattagtagctgt 
gctttgtctcagttctccgttcactgtaaattttggcacaccaacttggggagtaagagttc 
tgatattagttgctgtcaggaagtaccataaagctgaattatacaattaaaatttgggatcc 
aatcgcaaaagcacattaaggatatgatggggttgcagatccaaactcacagattccagttt 
atgctcgtccatacagttataggcactttccatattcttttctttaatctctgtctcttgct 
tgttattgttatgtcgtggtattcttgttgaggtcatgtttgtgaattgcgaagatggtcat 
gtataattgccgagaaatcatgtactagtttgttttaaacatgagcaaactgttattttgtt 
caagctactttaatatcaaaaaaaaaaaaaaaag 

SEQIDN037 

Ccacgcgtccgcccacgcgtccggaagaagaagcctgctgccatggcttccgagaaagaagc 
tgctcttctcaccgttccttcagattctcctaccttatttgacaagatcattaacaaggaaa 
tcccagcaaacattgtctacgaggatgacaaggttttagctttcagagacataaatccccaa 
gctccggtgcacattctgcttattcccaaggtcagggatggcttgactggactgtccaaggc 
tgaagaaaagcattgtgaaattcttggtcaacttctttacaccgcaaagcttgttgctaaac 
aagaaggtctgctcgagaatgggttcagacttgtgatcaatgatgggcctagtggatgccaa 
tctgtttatcatcttcaccttcaccttctcgggggacgacagatgaactggccacccggcta 
aaggaagccgagatgaattccagatctcatggagtatccagacttcatccgatcatctatgt 
gtagcacttactgaaaacactatcgtctatgtgtagcgtttgaagaatcaagctctaagctc 
gtcctatgctcctatggagtgacaaataggactcattccgactattatattgatcatcaata 
agagggatttctctgaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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SEQIDN038 

ccacgcgtccgcaatattttgactaatacactgttctgttcttcacctaattcttttcttct 
tctaataataacgtgctgctaagtcctaaagctcctctttggagctccaattaactccaaac 
taccaaaatccagagcaggtttaacaaagaatgggggttggaggacgtgaagtggcgatttc 
attggatggagtgagggacaagaatatgatgcaattgaagaaaatcaatactgcaattttcc 
cagttcgctacaacgataagtattacactgatgccattgcctctggtgatttcaccaagcta 
gcatattacagtgatatttgtgtaggttcaattgcatgtcgccttgagaagaaggaagctgg 
ggctgttcgtgtttacataatgactctgggtgttttggctccatatcgtgggctaggtattg 
gcaaaatgttgttgaaccatatcctcaatctttctgccaagcagaacgttagcgagatttat 
ttgcatgtgcacacaattaacgaagatgccctcaatttctataagaaatttggatttgaggt 
tactgataaaatccaaaattattatacaaacataaccccaccagactgttttgtcctgacca 
agttcatcactcaaacgaagaaatagatagtctcagctactttgattgagccttggtcaaac 
cttcacattatctctqaggttctgagctttctggttctagtttttgctacttatgagtaatg 
tgccacccattggattgttagtgtaagcccttttctgttctatcttatcctatctgcaacaa 
catcaaagttgaatgatttccctgtaatagaaatagtgcagttcaatgcaaacattcgagtt 
tggttatgttagatcacg 

SEQIDN039 

gatatcgtcgacccacgcgtccgaaagagaagaaatattaaatagcacaagaaaaatggaga 
gtgctaatgcatattctacattgccaatggaaaatgttaacgatgttgggcttattaatttc 
atggacgaggctaactttgaacaatttattgagctcattaagggtgaaactgctgaccctat 
cgtgaagttttgccccaactatgactgtgaacacattacaggttgttttccttcgactgatg 
tccaatttgagccaacaccaatggatatctttgattggaatgctacaaacatatctaatcct 
atttcacttttttcttccctccccggagaaatgaagctccgggaagaagaagaggaggagga 
agacgacaatgattacgaggaatcttctgggacaacaactaccaccaccatgactatgttgc 
cggcaacgccaacaaagaagagcacgaggactgaccgatcaagaactttaatttccgagcga 
aaaagaagaggaagaatgaaagagaagctttatgccttgcgttccttagttcctaatataac 
aaagatggataaagcctccattataggagatgcaatactatatgtacaaggactgcaaacgc 
aagcaaagttactaaaggcagaaatagcaggtcttgagtcttcctcaaatgaaatgaacaat 
aatccatttcagaataccaagcaaatgaaattgatgactcattatcctgcaatcaagaggat 
atcgaagatggacatttttcaagtaggagaaagaagcttttacgtgagattagtatgcaaca 
aagggcgacaagttgctggttctcttttcaaagctcttgagtctctttctggattcaatgtt 
caaagctccaacttggctacttctgccgatgattatattttgacgttcactcttaatgtgag 
cgaatgtgaggtagacatgaacttggccaatttgaagctatggatagctagtgcttttctta 
atcaagggtttgacttcgagacattaccattggcctaacgtttcattattgtaattgtgcag 
agttttaaccggtcaaagaatgagaaatgtcattatttatcggtcgtcatttgtaacttttg 
attatttagagtcacgtattctaaaagagtaaagtttgtcaaattgcaatggcgcgcatcgc 
actgtgtacatgtgaccgacctaattgtttattacggttgactttgttactactacttttgg 
aatcaaaacagtcatggcgggcgcg 
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ccacgcgtccgctttccacattctctcaactttctctttctaaaactcttcctctttttcta 
gcacacagaccttcaatggcatcgccgcgcgaggagaacgtgtacctggcgaagcttgctga 
gcaagccgaacgctacgaggagatggtagagttcatggagaaagtcgtcggcgccggcgacg 
acgaactcaccgtcgaggaacgcaacctcctctccgtcgcgtac'aaaaatgtgatcggagcg 
aggcgagcgtcgtggcgcataatctcatcgatcgagcagaaagaagagagtcgcggtaacga 
agatcacgtggcctccattaaaacctacagatctaagatcgaatctgaattgacttcgatct 
gcaatggtatccttaagttgctcgattcaaaactcattggcaccgctgctaccggtgactct 
aaggttttctatttgaaaatgaagggagattattacaggtacttggctgagttcaaaaccgg 
agctgagagaaaagaagccgccgagaatactctttcggcttacaagtcggctcaggatattg 
ctaatgttgaattagcccctacacatccaatccgattggggctagctctcaatttctcagtg 
ttttactatgagatattgaactctcctgaccgtgcttgtaatctcgccaaacaggcatttga 
tgaggctattgcggagcttgacaccctgggagaggagtcctacaaggatagcactttgatta 
tgcaacttcttcgtgataacctcactttgtggacctcggatatgcaggatgatggaactgat 
gagatcaaagaaccatcaaaagcggaggagcagcagtaatgtgagtgaagcctctttgctta 
ggattgaatcctatggcataactttgctcattgatcgaaatttgctgtttgtgtagttctga 
attccctgaattgtaatacctaaaagcactgtttcttgccatttgttgttttcagcaaagat 
tactttttctctcggtatttcccttgtatttggatgctccagtgaaactctcttatttcgtg 
gaaatgaatgcttg 

SEQIDN041 

ccacgcgtccgcccacgcgtccgctccatgtttcatttactttggagttggttgctaaaaca 
gattaaagctagctgctaagctagtactgttagagttttgttaattagaagaaactaaagag 
tcaaaaacagtggatccaaggcatggaaagaggggacttttcatccaatgaaatggaaatgg 
aagagaaagagaataacgataatattgatgatcctcaacttcaagaggagctctataatata 
tactcagctcgatctcagcatgacatgtctgctatggtttctgtcctttctcaagttattgg 
aaacagtaccacccattcttcttctgctaatgctactccattaaccctacctcaatctgctg 
tagctctccaaaaccaatctcaatctattgaggatcaagggaattcgagaagaaaaaggtat 
agaggagtgaggcaaagaccatggggaaagtgggcagccgaaatccgagacccaaagaaagc 
agctagagtatggcttggcacttttgaaactgctgaggctgcagcaattgcctacgatgaag 
cagctctcagattcaaaggcaacaaagccaaactcaacttccctgaaagagttcaaggcaaa 
ttccaataccttactactactactagtcaaaatcatcacttgcctgataatattgttcaaca 
acaatatattccaactagctccaataataatcatcctctcccttgtcaagaacattatccta 
gtttacatcactatgctcagctacttcagagtgacagcaatattactgatttaaacttcggt 
atctcgccaagttataatcagcagttatctgcttcttttgattttgcgcaatcatcatctaa 
cagtacattatcggaattgccagcttcttatgagcagaggcaattacaatcaagttacaagc 
aagaagaagaagttttaatgagattttcatcgcattttggtactacttcaagctcatctgga 
cctcatgaaagtaactgggaagagtttgaagatagaaagtcataagttcattccctagtatt 
aagagatacgaagactgaaagaagttttatgagatttccgtcgcattttatttcgtagttta 
tggttttactggggttttctgtcctctgatcttgtatttcagttaagtgtaatagtagaact 
atatatattcatgaattaatggaaaaatattggtgtggttttatgtgtttaaaaaaaaaaa 
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SEQIDN042 

ccacgcgtccgcccacgcgtccgcccacgcgtccgatttgcttttccctcctcttctcctcc 
attttcctttaatgtcactaaaacagaagaaggggggaacagaaaagagttagaaaatgata 
ggagggaataatagttttgggaagacaatttgctcaatctgttatgaagatctaaatcctat 
tattgaagacctccaatccatttctatttgcggtcacgtttttcacgagatttgtcttcagc 
aatggtttgaatactgtacaaatggaaaaaagaagaattgtccagtttgcaaacaggcttgt 
tcagaacaaaatgcaaataggctttatttccaatcagttggtgatccaaatgatacaagtct 
gaccaagaaaccccgtgatcatgaagaggatccacgtgaactgagaaatgaggtcaaaagat 
tggaggggaaagttttacagttgacttctactttggagaaacagctgaaagatctcaaagaa 
gtcaatgcagagcttttcacatgcaaggaagagttgaaaatagaagcgactctaaagaatga 
agctgtgaaacaggaggcagccattcagcagttgttacatcttaaatccaaggagct agate 
gatcaactttggagtgcataaggctaaaagatggaaatatggctctagatagggagcttgca 
gcactcaagttgagttacaaagaactagtgaccaagtgcgatactcatggaaggcgagaggc 
tcgttctcttaggaaacttgagaagtcaaaagaaaagataaataagttgaagaccagggtcc 
aagaacttgagacggcacttgaaagaaaagaaaaagataatgaaaatttgagaactttgaga 
gctgccaagaaaaactttgagttgtatcaaggaagcaaagaacccaaagttgaccgacgttc 
atatgagaatcagaataaggcacctgctgcgacagaagtagatttatgcatagtcactggct 
catgcaatgatttatctagaccaaggagaaaaagaaagtctaagtctaaggaaaagagtata 
caaaacacggcagaagatattataactggtggaagtcaagtgcagggatcagaaaataagga 
tggaatctcaggttcaaggaattcccctgttattattcttgacgatgatactgatcttccgc 
ttctagatgatgttacacagcatcagccctcgtttcgcatcaggaaagagacttctgcacca 
gttatacttgcccatccaggagatacctgtttttctggtggattattaggtcctgatggtac 
ttactggcacttgggaaaatggtgcaagaaggttaaggacaagggatctggatcactgtctt 
gaggactgcaaggatcaggtgtgactgctgttgatttgattgctgtaggagctgacggtaga 
ggcggttggatcaaagttctgcgatcaatgaatccgggatcattgcaggacaaaaataagag 
tgtcatcagtcaagagatacaagtatgacatgaaatcaagtagttcccagtctcaaggatgc 
ttgcatacagatagcttcttcagaagaaccagtggataaccttattaacagtgctgcttcta 
cattaccaactgtagataatagagattaaattccttatacattgtttaggggtttaattttt 
agcaatctagttatactaccatttgattgaatggtccgaaaagaagaaacttaatgtcttct 
tttgagcatgtaaagtagggattcaaaggggaaaggtagcatacaggggagagaaaagcaag 
aaagcaacttcagcaattgtttcttagcggttttcagttatgttgcttgctcaatccatatt 
gaaagtatactcttggtaaaaaaaaaaaaaaag 
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SEQIDN04 3 

ccacgcgtccgatcaaatcttggcaactatggcttccttattatcggtgccattgtctctat 
catcctcatcaatgcaattaagactatcatcaaggatagactccattacttcaacaaccaag 
ctagtgaaggaggagagccaagtagagctcctcagaaagttttaccaattgagatgtccttg 
gatgagctaaatatactaactgataactttgatgagaaagctctgattggaaagggatctta 
tggctgcgtttttggtgctaaattaagcaaitgaccaacaagtagcaataaagaaattggata 
ctagttcttcaccagaaccagattccgactttgcagatcagttagcaatggtttcaagactt 
aagcatgagcattttgtgactctaatgggttattgcgtggaagcaaacaatcgaatcttggt 
ttatgagtttgcaaccatgggcacgttgcatgatgtattacatggtagaaagggagtacaag 
gtgctaagcctggtctacttcttacctggaatcagagagttaaaattgcttgtggtgtagct 
agtggcctcgaatatctacatgaaaaagttgaacctccaattattcatcacgatgtt agate 
tagcaatgtactactctttgatgatttcacagcaaagattgctgatttcaacttgacaacct 
ctgaatcttcagaqgacttrggctaccatggtccagagtatgccatggaagaagagataaca 
aagaaaagtgatgtttatagttttggagttattctattggagctgttgacaggaaggaagcc 
aatattagattataaaqggcaacagagtcttgttgcatgggcaactcccctattaagtgaag 
ataaagtgaaggagtgtgttgatcccaacctaaataatgactaccctccaaaggcaattgcc 
aaggtggctgctcttgtagaactttgtgttcaatatgaggcagatttccggccaaacatgtc 
aattatggtcagggcactgaagccacttctcaatgcaaattaaactagaacctcaagcatac 
aacattattcttaattgaaacaagctgcagggacagttttaagtaccaacagggctgtgacc 
caagtcatggttccgttgccaatccaaggaaaaaggaaatcttgctaaaggactattggaat 
gttgttttcttaactgatttgtttaagattaaatatattatgttctcattttaaaaaaaaaa 

SEQIDN04 4 

Ccacgcgtccggaatcgacaatgaaattcagccgcgcattcaatgccgcttcagttctctta 
gttcttcttcttattaccatagttacggctaagaagtccggtgatgttacggaattgcagat 
cggtgtgaagtttaagccaaaatcttgcgaacttaaggctcacaagggtgatagagtctcag 
tacactacagtggaaaacttacagatggaactgtatttgactccagctatgagaggaatgac 
cccattgagtttgagcttggaagtggtcaagtgattaaaggttgggatcaaggacttcttgg 
aatgtgcgtgggagagaagcgaaagttgaagatccctgctaaacttggttatggcgagagtg 
gatctccaccaaagattccaggtggtgctacacttgtcttcgacactgagctggttgctgtg 
aatggaaagagatcagcagctgatagtgaactgtgatttaacgatgtctctacactcttcat 
tagcgacttctaaatctatttttaggttatcttatagttatatttgcttctttttgataatt 
tagatactaaagtattggctgctggcaaaatgacacctcaagtgtgtttccttttgtcacta 
gttttttcctctgctaaagttaagtggatggacgatgaactcccaagatggttttgccatca 
ttacttttaaaaaaaaaaaaaaag 
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ccacgcgtccgcgaaccaaaacttccagaacccaatatttacaacctgtcttccgccaccat 

tgacaagaaccttaaaaagcttgcttccaagtaacaataagagaaggttgattggaaaatac 

atttgattatttttattagagaatggcggggactaaggaacagataatggatgttcggtctg 

tggtggaagcagtaacagccgccggagatgatgttgagattgatactcctctttatgttgtt 

gaaagcctctgtatgcgctgtggtgaaaatggtacaacaaggtttctcttaacactgattcc 

acattttagaaagatattattgtcagcatttgactgcccacattgtagtgaaagaaataatg 

aagtggagttcgctggtgaaattcagcctcgaggatgttgctatggcttgcatattccatca 

gqtgatcaaaagatgctcgaccgaacggttgtcaaatctgaaagtgctaccatcaagatccc 

tgaactggattttgagatcccccccgaggctcagcgtggatcattgtcaacggtagaaggca 

tactggttcgagctgctgacggtttgggggcccttcaagatgaacggaagaaagtggatccc 

cagatggctgaagcaatagatcggttcttgataaaactgagagcttgtgcttcaggagattc 

atcctttactttcattcttgatgatcctgctggtaacagctttattgagaacccgttagctc 

catctcctgatccctcattgaaaatcacattctatgagcgaactcctgagcaacaggcagct 

ttagggtatcttgccgacccatcacagcttggaggacaaagtgatgaggtatcaagtgaggg 

tataaataatgttcctcatcacctgctaaaggaaccacatggatcagttggagcaagagcag 

gacgtcaggctattgctcagggtaacagtgcagaaatagctgaagctctatttcgatattca 

gctcctgaaqaggtgatgatgtttccatcaacttgtggagcatgcgctgcgaggtgtgactg 

tagaatgtttgttaccaatattccatactttcaagaagtaatagttatggcatcctcttgtg 

atgcttgtggttatcgcaactctgagctgaaacctggtggtcctatatctgataagggaaag 

aaaattacccttcatgtggaaaacattgaagacttaagccgtgatgtgattaagtctgatag 

tgctggagtggaaattcctgagcttgagttagagcttgctagtggcactttgggtggaatgg 

tgacgacagttgagggtttaatcacaaaaattaacgaaagtcttgagagagtacatggattc 

acatttggagacagtcttgatgaagacaggaagagcaagtggttggacttccgagcaagact 

agacaagcttttgagcttgggacaaccgtggacattgatcatcgacgatgcactttcaaatt 

cttttgttgcacctgcaaccgatgatatcaaggatgacaaacagttaacatttgaagattac 

gtaagatcgtgggagcaaaatgaggagctgggtcttaatgacatggacaccacctcagctga 

tgctgcttacagttcagcagatgctgcacccagtgagaaagctgacgattgatgaatttatg 

cttagtgattttctttcatactgctttggccttaaaatctaaggtaagcgttgattgttctt 

tcatatgactgtagaagagatctagaaccataaaagattgccaacgcctgcagccatgtcta 

catagtggccttgtgactagaactcctttaaatagagagacaaacattttaattagctatac 

gggttcctttaatcaaacacttcagagttattaacaatgcaatttgttttagaagatagttt 

gcaatgcaatgatttttgacttgtaaaaaatatcaatc 
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ccacgcgtccggtggatttgtgtgcagcaccaggcagctggagtcaggttttaagccggaag 
ttatatctcccagcaaagttgtcatctgataccaaggacggcgatctaccacttattgtggc 
tattgacttacagcctatggctcccattgaaggtgttattcaagtacagggtgatataacaa 
atgctaaaacagctgaagtggttattagacattttgacggatgcaaggctgacattgttgtc 
tgtgatggtgcacctgatgtaacgggacttcatgacatggatgaatttgttcagtcccagct 
gatattggcgggcctaaccattgtcactcacatactaaaaggaggcggaaagttcatagcaa 
aaatttttcgaggaaaagacacaagccttctttactgtcagctaaaactatttttcacagaa 
gtgacttttgctaagccaaaaagcagtcggaattctagcatagaggcatttgcagtttgcga 
gaattactctccacctgaaggatttaatgagaaagatcttcatcgccttcttgaaaagattg 
gaagtccatctggcacagaggacctagattgcagtagtgcatggctggaaggtcctaataag 
gtgtatattccatttctggcttgtggagaccttagcgggtacgattcagaccgtt.catatcc 
acttcctaaagctgcagatggaacctatcagtgtttagatcctgtacaacctccaattgcac 
cgccatataaacgagctcttgaaatgaagaaagcgtcgaatcaaggaatccaaaacctagac 
aagctttctcttagctcctaatcttaccatccagaattattccattctgtgacattggaaag 
tcgcttatacgtcaccaaatgtaaggactttttattgttaactgcacttgcaattaatgaat 
ttaatgtgttttgttaaaaaaaaaaaaaaagg 

SEQIDN04 7 

ccacgcgtccgtgcagaaatggcgactcgttatctgactcgttccttattcactgctctctc 
tcgctcatacacttctctttctctctctacacctcctccctctctctcttctttttctctcc 
tccgtcttcgaccgcttattgccgtcacagctgtcaacctccgcagcgtctctccggcggta 
gcaacctcctttcgagggtttgcgactcggcaaacgtcgtcgtctttaaatgacccgaaccc 
gaactggtccaaccgtcccccgaaggagacgatcctacttgatggatgtgattttgagcact 
ggcttgttgtgatggagaaacctgagggtgatcctaccagagatgaaatcatcgatagctac 
atcaaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgt 
ctcgacaagacattactatgcatttggagctcttgtatccgaagaacttcattacaagctaa 
aagaactgcccagggttagctgggtgcttcctgattccttcctggatgttaagaataaagat 
tacggaggggaaccttttatcaatgggcaggctgtaccatacgacccaaagtaccacgagga 
gtgggtaagaaacaatgcccgagctaatgagaggaacaggcgcaatgaccgacctcgtaact 
ttgataggtccagaaattttgagagaagaagagagatgcagaacactggatccaacatgggt 
ggtggacctcccaatatgacgaatgcgccaaccccaaacatgggtgggatgcagcagcctaa 
catgggtgggaggcatcagcctagcatgggaggaccacagcagcccaatatgggaggtgcac 
ctcataactacggtggagcgcctcccaataactatggtggagcgccaaacaatccaaataat 
tttcaatacaatagtggacaaagcaacggaggcatgccttaccaaacaggtccaggaccaaa 
ccagaattatgcttcaaatacatctggtggaaacccttatcagaatccaaacatgcctggaa 
gagatatgccccctccaaatcagaactatgctccgaatacggctggtggaaacccttatcag 
aatcaaaacatgcctggaagagatatgccccctcgaaattatcaataggctgatatagataa 
gtatgaactttgtatttccagagttctgtttcacgaaatgagaacatagctatggtgtgctt 
gataggatgttgctgcgtgtaatagttgaatgtgcaaaacttatatgctttgtgagtatgca 
atgtcaaggtgttctcatcctattgcatcctctatgttgacatgctctctgtcaattctcct 
gatgagtttactagcctgaccaagaatatgttatgctttaccatgttgaatgcttgaaattt 
cagggcctcattgcaggtactgttcaaaaaaaaaaaaagg 
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ccacgcgtccggaaatggcgactcggtatctgactcgttccttattcactgctctctctcgc 
ccatacacttgtctttctctctctacacctcctccggtctatcttttctttctctctcctcc 
gtcttcgaccacttatcgccggcgccgctgtcaacctccgcagcgtctctccggcgggagca 
acctcctttcgagggtttgcgactcggcaaacgtcggcgtcgtt'aaatgacccgaacccgaa 
ctggtccaatcggcccccgaaggagacgaccctacttgatggatgtgattttgagcactggc 
ttgttgtgatggagaaacctgagggcgatcctaccagagatgaaatcatcgatagctacatc 
aaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgtctc 
gacaaggcattactatgcatttggagctcttgtatccgaagaactttcttacaagctaaaag 
aagtgccgaaggttagctgggtgcttcctgattcctacctggatgttaagaataaagattat 
ggaggggagccttttatcaatgggcaggctgtaccatacgacccaaagtaccatgaggagtg 
ggtaagaaacaatgcccgagctaatgagaagaacaggcgcaatgaccgacctcgtaactttg 
ataggtccagaaattttgagagaagaagagagatgcagaacactggacccaacatgggtggt 
ggacctcccaacatgacgaatgcgccacccccaaacatgggtgggatgcagcagcctaacat 
gggcgggaggcatcagcctagcatgggaggaccacagcagcccaacatgggaggtgcacctc 
ataactatggtggagcgcctcccaataactatggtgggtcgcctcccaataactacggtgga 
gcacctcccaattactatggtggagcgccaaacaatccaaccaattttcaatacaatggtgg 
accaaccaacggaggcatgccttaccaaacaagtccagggccaaatcagaattatgcttcaa 
atacatctggtggaaacccttatcagaatcaaagcatgcctggaagagatgtgccccctcca 
aatcagaactatgctccgaatacggctgacagaaccccttatcagaaccaaaacatgcctgg 
aagagatatgccccctcaaaattatcaataggccattgtatatgagtatgaactttgtattt 
ccagaattctatttcacgaaatagtaacagttgtagccgtc 

SEQIDN04 9 

ccacgcgtccgcaaaaccctaaactcttcaccttcaaacatcaaaatcctctcgcattctct 
ctagtaatggctaccgctaactcctcttctttctcacctgtatcttccccttcaaaccatgt 
tcccctaaagcgagtaggtactcacaatggtagcttccattgagatgaagctcttggttgct 
tcatgattcgtcttacaaacaagttttacaatgctcagattgtccgtactcgcgatacccag 
gtgttggaaacgcttgatgcgggtgcttgatgttggtggggtttatgatcctagtcgagacc 
gttatgatcatcaccaaaagggatttcaagaggtttttggacatggtttcactactaagctt 
agcagtgctggtcttgtttacaagcattttggaaaggagataattgcaaaggagctccaagt 
tgatgaagaacatccggatgttcataggttgttccttgccatttacaagagcttcatggagg 
caattgatgcagtcgacaatggaatcaatcagtacgatacagaccagtcacccagatatgta 
aataatactcatttgtcctcacgagttggaagactaaacttggactggattgaacctgatca 
gtcttctgaaaaggagaatgaagctttcgaacgtgcaatggatttagctggcagtgagttct 
tggatcgcgtccgctt teat gtaagatcttggttaccagcacgctcaat cat cat ggagtgc 
cttgctgcaagacacaagattgatcctagtggagagattgtagtttttactacattttgccc 
gtggaagcttcatttgtttgagctggaagaggagatgaagattgatcctcccatcaaatatg 
ctttatatcaggatgataggagcaaaagttggcgagtgcaagctgtgggtgtagctcctgac 
agatttgagagcaggaaagcccttccagctcagtggcgaggtttaagagatgatgaactctc 
caaggaaacaggaattcctggctgtgttttta'tccacatgagtgggtttattggaggaaatc 
aaagttatgaaggagcactcgcaatggcaaaagctgctttgaagctctaggcacaggaacag 
ttttataaatggatttcagaaactgagtgatctctttatgatttaacattatagctgatcat 
gacatcaggttgccatttaaatagcgcattggagttgaatttattcaaggttattaaggaaa 
ctatacacaaccaggcagacagttttttacatattcagatgctatcttttacttttac 
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agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgtcttcttctt 
cttcttcttcttcttcttcttcttcttcttcttcttcttcttcttcaatttttctctctctc 
tttttctctagggtatacagaaatggggatcgcaacggagaatcaaccacagcaacaacaca 
aggcgtcaccagaggcatcatcagaggcagataaaaagaggtggatgcttaatgattttgac 
attgggaagcctcttggaagaggaaagtttggtcatgtatatctagctagggaaaaaaggag 
caatcacgttgtcgcattaaaagtgctgttcaagagccagctaaagcagtcccaggtcgaac 
atcagcttcgtcgtgaagttgagatacaaagccaccttcgtcatccaaatattttgaggctt 
tatggttacttttatgaccagaaacgtgtgtatttgatcctggaatatgctgccaagggtga 
actctacaaggagctgcagaaatgcaaatattttagtgaacggcgtgctgcaacttatgttg 
cat cc ttagcccgagccctaatatactgtcatgggaagcacgtaatacacagagatatcaag 
ccggagaatcttttggttggtgcacagggtgaactcaaaattgcagactttgggtggtcagt 
gcatacctttaatcggaggcggactatgtgtggcactctagactatttgccaccagagatgg 
tggaaagtgtggagcatgacgcaagtgtggatatttggagcctgggtatcctctgctttgag 
tttctgtatgggatgcctccatttgaagcaaaggaacactcagacacatatcgaaggattgt 
gcaagtggatctcaaatttcctgccaaaccaattgtctcatcagctgccaaggaccttatta 
gtcagatgcttgtaaaggattcttctcagcgtctgcccctaaaaaaggtcctggagcatcct 
tggattgtgcagaatgcagatc.cttcaggtgtttataagggctgatgaagacatcaccaatg 
actcacaatctttgtggcggactaaattgtttttgtttttcactgaaaaagcctttgctcag 
cgtta 

SEQIDN051 

gtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcg 
tccgcgggaagattctcatgcaattaaccgaatcgtcaaattttcctctaaaatataaagtt 
tctccggaaaatgtcattcatcgatgaatttcaagccaatatagaagctcttccgaaccatt 
tacggaggaaatatgccttattgcgtgatttagataaaagtctgcaaggagtccagaggcaa 
aatgagcaacgttgtgagaaagaaatagaggatatgatacagcgtattaaggctggtaacgt 
gacaccagactcttcactaatcaaattctctgatgatgcattggatgagcaaaagcatgcaa 
tccgaattgctgatgagaaagttgcattagcttctcaggcatatgatctggtagacgctcac 
attcagcagctcgatcagtacttgaaaaaatttgatgaagagctccggagagaaagagatgt 
tgctgttgttactggaactcctgctaccactgttgaaaataatggaaagtccggaaggtctg 
gtgaaggtaagggagggcgcaagaaaacacgtcttgctacagcagcggcagctacagccact 
gcagcagcagcagcaacaccaagtggaatggatttggatctacctgttgatccaaatgaacc 
aacatattgtttctgcaatcaagttagctatggtgaaatggttgcgtgcgacaatcctaatt 
gcaaaatagagtggttccactacggctgtgttggccttaaagaacagccaaagggaaaatgg 
ttttgcgcggattgtgcaggaacacaaaagaagcggaaaggcagatgatagtagtagaagaa 
aataattcagtatactgatttaagacgttttaccaccggaaaaatttatgtagatactgtac 
ttctgtaattttgttatgtgtagccattattaacaagtcactcttgcattctaattgtagga 
gggaagtacaataagtcaacaaaaaatttactcttgtttattatgaactataacgaacaaat 
aaactattgtcttttaccaatcaacatatttgtaatc 
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Ccacgcgtccgcccacgcgtccgtttcatattcttcttcttctgcttcattgttattgttta 
tagaaaaaaatccaagaatggagcagttgcaagagggttttcgttttcgtcctacagattca 
gaacgacttatgtttttgttgagattcattgctaaacaagagatgaatgattctggatttat 
cacaacaaacatcgacgtctatggcagagaagaaccctgggaaatttacaatcacggcgtat 
cctgtggtaatgaagataatgcggactacagcagtaactatcgctatttcattacaaagctg 
aagaagaaaaacaaggcgaggcataatctagaggttggaaataaagggagttggaaacaaca 
agataagggtaaatcagttcactacaaaaatacgggaaattcatcttctgtggttattggat 
gcaaaaagagcttgtgttacgtgaataaacatcagtgctataatcagagcgatggacattgg 
ctaatgaaggagtacgagctttctaatgttattcttcagaaattcgacgaagattgtagaga 
ttatgttctttgtgccatcaaaaggaagtcatgttctactgattatattgagcggccattgg 
caagggtgcagtatcaagtgaatgatttgggggactatatgcagagcaattcagggcattat 
gtggaatctgaaacggacatgacgacacagaacgaggtgcccgaattagaagttcttgatta 
tcaattagaagttcttgggatgaaaaggacttagctgatttaaattggatgttatatgatat 
gcctgtggtcgatcagacggtgaatattgtcgagcagcagaggaaccagaggtcagttatta 
ataagagtgatgaattctatcagatgttggcacaaaacgaagcttttgagttctattgatta 
actgtatagtcatattcttggtagatgatagagatttgattaacaatggcatatgtcccact 
ttgtagaatggaatttaagatagtagtacatctatatatctttgtataacagtatggcgcgc 
gcc 
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ccacgcgtccgtctatctaagcaatttccgtagttccaaacacggtctaaatcagattcctt 

gcttttttcatctcaaattctctactttttgttgcttcgcaacttggcttagatccttcttc 

atcacgctttgtaactgcttcactcaagctatttcacgatgcgatcaatgcatagatagctg 

aaaattcgaagtgcccggaagaaatggagaaggagtcacatggattaattataggcatctca 

attggagttgtgataggagtgcttttagctatacttgcatttttctgctttaggtaccatag 

gaaacgtcctcagatagggaatagcagttctaggagggctgccactattcctattcgtgcaa 

atggtgctgatacttgtacagtattatcagactcttccattggtacagagtcaccaaaatcg 

actatccagaatggcatgtcagtgtggcttggcggccttaggaaggcaaatgttgtttctgc 

ttccggtatactcgagtactcctacaaggatttgcagagagcaacctacaacttcaccacat 

tgattggtcaaggggcctatggtcctgtttataaggctcagatgtctactggtgagacagtt 

gctgtcaaagtgctcgcaactgattctaaacaaggagagaaagaattccaaacagaggtcat 

gttactgggaaggctacatcatagaaacctggtgaatttggttggatattgtgcagagaagg 

gtcagcatatgcttatctacgtttacatgagcagaggcagtttggcttctcacttgtacgat 

gaaaagcttgaacccttgcactgggatttgagagttcaaattgctcttgatgtggctagggg 

cttagagtatcttcatgacggggcagttcctccagttgtacaccgggatattaaatcatcca 

atattttgttggatcagtcaatgagagctagggttgctgattttgggctttcaagggaagaa 

atgatcagtaaacatgtatccaacatccgtggaacattcggatatcttgatcctgaatatat 

atcaactaggtcattcactaagaaaagcgatgtttacagctttggggtcttactgtttgaac 

ttattgctggtagaaatcctcttcaggggctcatcgagtatgttgaactagcagccatgact 

acagatggaaaaggtggatgggaggaaattgcagattcccgtcttgatgggaagtatgattt 

gcaagagcttaatgatgtagctgcacttgcatacaaatgtgtgaatcgtgcccccaagaaac 

ggccttccatgagggacattgtgcaggttctgtcaaggatacttaaatctagacccgacaga 

aagcgtcccaagcgtttttcatctgcaacagcagaggaggttaccatcaatgctgaacaacc 

agattatcggagtccaaactctggaccccgacgaggggaatctatggacagcccagctgact 

catgtgaagtttaacccagttcttccatttgtttatttttttttttttaatttcttcctctt 

cttttcttcttgtaaaattggtcaggttgttaggttctccattcataaca'cacttctgtctt 

ggtgcgttcgattggggtacttaggatctgttatagtctgcgtgtaagatagcctttctttc 

tttccaattttgttaaatttttgtaaatttgcgtggaaggtaaccgaatggcagaaggaaag 

ggtgaaaagcccagatcagccttttgtcaattctatgaaagttcatatatctttccacaaaa 

gtgcacgg 
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ccacgcgtccgccagcaagcacagtcgtccacaattagatattgatctgaatgtaccagatg 

aaagaacttttgatgaaataaattctcgagattctgctctagagttgatctctccattggac 

catatgactaatcgtgctgcactgaagaatgaagtaattgattctcctgctgttcgctgttc 

tggaggactcgatcttgatttaaatagagttgatgaacctggtgatgtagggcagtgctctg 

tgagtagcagtagtagattggatggtgtagttttaccttccaaaacatcatcatccattggc 

ttgccaactggggaggtgaggagggactttgatttaaataatgggcctggtgttgatgattc 

cagcgcagaacagtttttattccacgataatcatcagggaagcatgcgttcccaactgcctg 

cttctagcctcagactgaacaatccagaaatggggaatctttcttcttggtttactcctggg 

aatacttattcaactgtgacacttccatcaattttgcctgatcgtgtggagcagccgccatt 

cccaatggtcacacctggtgcacaacgaatattgggtcctcctgctgctggttctcctttca 

ccgcggatgtttaccggagttcagtattgtcatcatcgcctgccgtgccttacccatcct cc 

ccttttcagtatcctatatttcctttcggaacgagcttcccacttccttctgcaacattttc 

agttggatcagcttctttcgtagattcttcctctggtgggcggctttatacgccccctgtaa 

attcacagttgctgggtcccgttggcgctgtgtcatctcaatatccaaggccttatatggtt 

ggacttcctgacagtagcagcaacggtaccatggatcacaatagaaaatggggaaggcaggg 

tctggatcttaatgcaggccctggagtggtggacatggaagggagagaagagtcggtttctt 

tgtcggcaaggcaactctctgttgccggttcacaagcattagcagacgagcatggtagaatg 

tatgctgtacctgggggtgttctgaagaggaaggagcctgagggtgggtgggacagtgagaa 

cttcagattcaagcagtcatggcactaagatctgcaatctggtgattttataagctactgga 

ggatggacttggctaactcctcaaactctcagcttctggcatgctcctgtgggtgggcggta 

agtgagcaaatttgatgtgttcagagtctccgaccaccacctcttcagcttatcagtgtagt 

tgggatttccatggtttgcaagcactgcactttggtcagctatattctctgggtggatgcag 

atgagttttccctctgtagatatttaactgttggaaagcttgaaatctttgatgcccaggga 

ctggggataaatcaatgttatcctgtccaaattattgacaatggaggtccaatttcgagact 

gaatcaaacggaaagcttttctttgtgctttgctgttaatcatctttcaatgcttcccgtgt 

tcttggcttttctctgtcctcctttgcccattacatatgtatacagggttgacaccaaattt 

tggtactaatgctttcatcaggcatgttttagttgttgtggctgccattgtaccataaatta 

aatcgttctaacgttagtttgtagtctcattcacagatgatagaactcttgttaatgatatt 

ttcaatgatggtggggtgatgtgcttgtttttctttcaagctactaatctgaaccaacagtc 

ttgtgagcaacgaaaagacaacttctgttttctgatttggagaaattaaatgggtggagctt 

ttgcatgggttaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcacaattcttctacagtacaagaaaccaaaaaaatggcgagtcttaaagtt 

ccaacatctgttccagaaccttatgaagatgctgagcaactcaaaaaagcttttgctggatg 

gggtacaaatgaggcacttattattcagattctggcacatagaaatgcagcacaacgcaagt 

taatccgagaaacttatgctgcagcttatggagaggatcttctcaaggacttggatgctgaa 

ctgacaagtgattttcagcgtgcagtgcttctgtggactttgagtcctgctgagcgcgacgc 

ctacttggttaatgaagctaccaaacgtctgacttctagcaattgggttatcttggaaattg 

cttgtacaaggtcttctgatgatctctttaaggcgaggcaggcctaccatgctcgatacaag 

aaatcacttgaagaagatgttgcttatcacacaactggggatttccgtaagcttttggttcc 

tcttttaactgcattcagatacgaaggagaagaggcgaacatgacattggcaagaaaggagg 

caaatar.actacacgagaagatctctgacaaggcttacaatgatgaggagctcatccgaatt 

atttctactaggagtaaagcacagctgaatgcaacattcaaccactaccttgaccaacatgg 

cagtgaaatcaacaaggatctggaaactgattctgatgatgagtacctgaaattactcagcg 

cagcaatagaatgettgaaaaccccagagaaacactttgagaaagttcttcgattggctatc 

aagggtacaggcacagacgaatgggaccttactagagttgtcactactcgggctgaagttga 

catggaacgtatcaaagaagagtaccataagaggaacagtgttccattggaccgtgcaattg 

ctggacacacttcaggagactatgaaaggatgcttctggctttgattgggcatggagatgct 

tgaatggaatatgtgttctaagattggataagaaactatttcctaatgtctgaagtttgaat 

ttgtttgatgatgtgtggcatgtatgcccagagtttggtttgcattatatgggatttaaata 

atccaggtgttgtgttttggtttttaaaaaaaaaaaaaaaa 
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ccacgcgttcgggataacatcattatccttctctcctcttcttccttctttcaaccacaatt 
ctcactcccctctttcgtctctcttctccaacttcaatcccattttcaggcaaaaagctgtc 
atggcttcaatttcagcagcttctgccacagctacagcttctacaaagcttgcataccctta 
ttccccttcttcctcaagcagcagcagcaacactgctgctgtattcccttcaaattcctcaa 
agcttatcctttcctcttcttttacacccaccccttcaacccttttcctccactcaccaaca 
actactccttccaccacccacccccgtcggttcactgtccgcgctgcacgtggcaaattcga 
gcgtaaaaaacctcacgtcaacattggtacaattggccacgttgaccatggaaagaccacac 
tcacagctgctttgaccatggcgcttgcctctatgggcaactccgcccccaagaaatatgac 
gaaattgatgctgcccctgaagaaagggcgcgtggtattactatcaacactgccactgtgga 
atatgaaacggaaaacagacattatgcacacgtggactgcccggggcatgctgattatgtca 
agaacatgattactggtgctgcccaaatggatggggcaattcttgttgtgtcaggtgctgat 
ggcccaatgccacagactaaagagcatattttgttagctaagcaagttggggtccctaatat 
ggttgttttcttgaacaaacaagaccaagttgatgatgaggagttacttgagcttgttgagt 
tggaggtaagagaattattgtcaagttatgagttccctggtgatgaaattcctattatttct 
ggttctgcacttttagctttagaggctttgatggctaatcctagtattaaaaggggtgaaaa 
tcaatgggttgataagatttatcaattgatggataatgttgatgaatatatccctatcccac 
aaagacaaactgaattgcctttcttgatggctattgaggatgttttctcgattaccggtaga 
ggtactgtggcgacggggagagtagagagagggactgttaaggttggggaaattgttgatat 
agttggattgaaggatactaggaatactacagtgacaggggttgagatgtttcagaagattt 
tggatgaagcgatggcgggagataatgtgggattgttgttgagaggtattcagaagattgat 
attcagagagggatggtgttggcgaagcccggaacaattactccgcacacaaagtttgaagc 
•tttggtgtatgtgttgaagaaggaagagggaggaaggcattccccgttctttgcgggttata 
ggcctcaattttacatgaggacaactgatgtgactggaaaggttactgtgattatgagtgac 
aaaggagaggaatctaagatggtcatgcctggcgatcgtgtaaacatggtggttgagcttat 
catgccggttgcatgtgagcaagggatgaggtttgctatcagggaaggaggaaagactgttg 
gagctggtgttattcagaaaatcttagaatgatgaacttgcagctgagcatctcttttcaca 
tgatcggcactttccattgaagttacttaatccattgtcatatatgcaacttcttggttact 
tttattatgtcttagaatcttactttagtagaagtatcctgttttaaacaccaaattctact 
gaacttttgggatttttcctcagtctcctctttcatttttcctttgcttgaaaggaatgaga 
acatttgatttcatgcactttatttaatttagaacaaatgtgcgactctgtttaaaattaag 
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ccacgcgtccggtttttagctctgtttttgacacctataaaatgcccctctgcttcattgaa 
ctatctccttcctcattctattgacacataggaagaagaggggcgacttgttgtgtaaaaga 
gaaaaaaaaaatgtatgcagagacagggctaatgttcccttattttcagactttcccttctg 
aagttcaacaatttgaagacttctgttcctctcacgaacctaatgcgtcaatgggatccaac 
atatcggaatatgacctcgggggagaaggggatctctttaaagctccacaaccaattattga 
agaaccattgatgggccttgatcctatgactgctgctatttcaatgatttcttgtgcagaag 
atgccatctcgccgcaaggactcaaagtttcggatctagaaacttcgtttgagaatgaacaa 
ctcctgagtgaagttttctatgaatgcaaaaaagacctatttgacaaagatgcaattgatat 
accgttctctgaagtcttggacatgaaaattcctattgtgaaggccgacgaaaacctgactg 
cagatgagaacttggtttctcaagtatctttccagaaaagtattagttcagggtctttaacc 
tccatggattggatacacggggcttcaatgaggcccaattttatagattttggtggaatgga 
ctttggagctgtttatggtatgcgaagagcatacagtgaaggagacataaagactctgggta 
atggcaacataaatctgatccattctccactgggtcaaccacagattgtcggatgctccact 
tctgaaattcgcaaggaaaagctctccagataccggagcaagaagaataaaaggaattttgg 
cagaaaaatcaagtatgcttgcaggaaggcattggctgatagtcaaccaaggatccgtggaa 
ggtttgccaagaccgaagaaagctacacatcgaagaagcattaacagttttaactgtctctg 
agttggaagaattatagtaaggtagttcactggttatattagctgatgatgatataaatagc 
aaatggaagctagctttagaacaggatctgctcaaataagttggggatccatccatccaaca 
acttgctagtttgttaaaatctttggggtagcggcaataatctttgtagattagacaaatca 
actagtgttgtatatagtgtttgttaaataaaattctgtagcttgctattaatgctggataa 
tgtatttccgatatctctatgttcagcggtccagaccgttactctgtatcttactgacgcaa 
caatttctgtatcttactgacgcaacaattataactatgcttcagtgtatcaag 

SEQIDN05 8 

ccacgcgtccgctcaataactaaatatatatatagctcagattaatttatcaagaccttgtg 
aagatgaagacttctacttttcttgcaatgttcttggtcttaacgttggttctccaagggga 
atttcaggcgagcgaggcagtgacatgcagtgcctcgcagctaagtgagtgtgtgggggcgg 
tgacgtcgtcacaggcaccatcttcggcatgttgcagcaaaatgagggaccaacagccttgt 
ctgtgtgggtacatgaaggatcccagcctgagacaatatgtcaatagtcctaatgctagaag 
ggttgctaacgcctgtggagttgccgttcccagttgttaaaatacttatgtgtgcaaaagta 
aaagcctttattaattactgttgcttgtactaagggaattataagcctatgttgttggcccc 
ttttacctaaataaaaaaggttgtgatgctaaaaaaaaaaaaaaag 
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ccacgcgtccgctcgcgttttagcagccactggagaaatcaaataggagagagaaggtagtt 

tctagagagagaaaccaaacaaaacaacaccagtttttagagagagaaaaaaagtaaaccgg 

actattctcgaagaaaattttccggtgactgtataaattattttcaagtgaagtttccatat 

ttgtacactcattgtcaattgattgcgttgccgtctccagattctccattaccgatttggta 

attaggttttcgagatcggttggttatcatccttcgattcgttaattcgggttaacaggaat 

tttttggtttcgattcgataatcgggttcaagtatttcagaaagagaacagaaaaaggaaaa 

aggaggtcttaaatctgtttggaagtgaagggggttttggttgaaagatgttgaccaccatg 

gttggtttgtgatgtaatatggcacgggttgttacagataaagatatgtcgttttacattgg 

tcgcgaggcttcaaagttgtggaagagattttgtgcggagataacaacagagatcaatcttc 

ttgctgagaattggaaatatattcttggcggtttgatttgtcagtacatccatggacttgcg 

gctagaggggtgcattactttcatcggcctggaccaattcttcaggacgtcggcttctatct 

tcttccggagcttagacaagatagagcttacataagtgaaactttatttaccaccatctttc 

tatcttttgtcttgtggaccttccatccttttatttttaagaccaaaaagatctatacagtt 

ctgatatggtgcaqagtcctggcattcttagtcggttgtcaattccttcggatcataacatt 

ctattctacgcagcttcctggtccaaattatcactgtcgtgagggttcaaagcttgccacgc 

ttcctcctcctgacaatattttagaagtgctattgattgttcctcggggcgtgctttatggt 

tgtggtgatctcatattttcttctcatatgatattctctcttgtctttgtgcggacatacca 

gaaatatggaacacgaaggttcataaaacagtgtgcttggttagctgttattgcacaaagct 

tattgattcttgcatcgcgcaaacattacactgttgatgttgtggtggcatggtacacagtc 

aaccttgtagtgttcttcattgataaaacgttgccagaacttcctgatcgcactagtgcctt 

gttgcttccagtgaccaaggatagcaagtccaaagaagagaatcacaaactgctgaacggaa 

attctggagatcctgcagaatggaggcctcgaaacgggaagatcgtggaagatgggaaaaca 

gtgcacgttgaagcagtaattaatggtgcatagacgataaacttcatgcaacaccactaact 

gatgcttgcgaccttggtacagagattggtaacaatgccattataagttgtgttaatataaa 

tcgttctgggtgttcttccaagttcaatagttttggttttagcgtaggatacgaaatcaaag 

attgagatgctatcgatgtctccacggtcctctgattttatcaaatgtatcatggaatttat 

tttattttttggttaatgcaatatttcccatccg 
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ccacgcgtccgatagatatagatacagaagagagagagagagaggtggtgggtgtgaatatg 
gtatagggcctgagacccctgaaagggcatggaagtgctggccatgctgaggcaactcattg 
gacaagttaaacaactcttacaacaacaaaatacacactctccttcttcttcctcctcctct 
tcttcttcttctaacttctcttttcctcttcaatcgccaccgcttttacacctcccaaggtg 
ttatgttctgaatcttgatgacagttctgctgaagacagttgctacaatatcattatgactg 
ctggaaaatctgaaaatctcaagatgttggaacctggcaagcctccaccaaaaaagaaagct 
cggaaggagaggaatcgaggaaaagtgactggaacttcatgctccatagagaatttggatcc 
gcaaatatggaaagaatttcctgaagacttatttgaagctgttattgcaagactaccaattg 
ccacttttttccgcttcagatctgtctgccgcaaatggaactcaatgctgatgtcccaaagt 
ttttctgaacagtgcacccaagttcctcaaccacaaccgtggttctacaccat tact cat ga 
aaacgtgaatactggagccatgtacgaccctatgttgaagaaatggcaccatcctactatac 
ctgcactgccgaccaagttgatagtcttgccagttgcttctgcaggaggtcttgtctgtttc 
cttgatattggacataggagcttctacgtatgcaaccctcttactaggtcctttaaagagtt 
accagccagatctgttaaggtgtggtctcgtgtggcagtagggatgacattgagtgggaaat 
cagcttacagtatcctttgggttggttgtgatggtgaatttgaagtttacgactccagaaag 
aactcttggactcgtccaggatctatgtcctcaaatgttaagcttcctatggcactcaactt 
caagtcgcagacagtcaccatcggtaataaattttactttatgcgctcagagcctgatggaa 
tcgtgtcctatgacatggttactgggatctggaagcagttcattatccctgcacccctacat 
ctgagtgatcatacactagcagaatgtgggggccgcataatgcttgtcgggctgctgacaaa 
gaatgcagccacttgcgtgtgcatatgggaactgcaaaagatgactcttttgtggaaggagg 
ttgacagaatgccaaatatatggtgcttggagttttatggaaagcacgttcggatgacttgc 
ttgggtaacaaaggtttgctcatgctatctttaagatcaagacaaatgaaccggctagtaac 
gtatgatttctcaaccggagaatggatgaaggtccccggttgcgtgttgccccgtgggagaa 
agaggcaatggatcgcgtgcgggactgcttttcacccccgtcttacagctttggcttaactt 
gggatgcccagtaaatttctagtcacagcagagtgcgatttattatatcatgtggttttagc 
ttttcccatcataatctgcagcctagtgttctctttgctgaatttattaccactctcttgta 
taaacatctagttgttaagcttttcattccagaggactaatctacgactacttattattaca 
ttaaaaaaaaaaaaaaagggcggcc 
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ccacgcgtccgccaatatcagatttctttcatgaactccacttccaatttctcattgcttct 

tcttcccatttccacctccaaagccatccttccagaaaaccttgttccttacatttcttagc 

cccaaaaaagattcccatctcaattccacaaaaaaacacaaggagatctaaggaaattcccc 

gcctctatatatagagaggtggaattgttcctgaatttggtttgaattgattgattgacaga 

ttttggtgagagggtgttattgaaaaaatgggtgacatgaaggataaagtcaaagggttcat 

gaaaaaagtcacatcttcttcttcaggtaagtttaaaggccaaggtagggttttgggtggtt 

catcttcttcaggaccctcaaatcatgtcaataatttttcatcacatcccctaaatacaagg 

caagatcaacaaccttcatatacaaaaacttcgcctcaaaaaccaagtaattctgatcaaag 

aattgagaatatatgtgaaattcagttcaacaaaagtgaatcaaaggatggttttgatccat 

ttggtgaattagtcacttctgggaagagaaacccaaaagggtattcacttactaatgtgttt 

gaatgccctgtctgtggtagtggttttgtttctgaagaagaggtgtcaactcatattgatag 

ctgtttaagttctgaagtgtcttctaatttgggagttgaaagtaaagttgaagttaaaagtg 

aattggaaacatgtgttagtgcatatgtttcagggaagccctcagaagggtcagttgaagtg 

gtcattaagttgttaaagaatattgtgaaggaaccagagaatgccaagtttaggaaaataag 

gatggggaatccaaaaataaaaggtgctataggtgatgttgtaggaggagtggagctattgg 

aatttgttggatttgagttgaaagaagaaggtggggaaatttgggctgtgatggatgttcct 

tctgaagaacaacttgttatgcttaagaatgtagtttcactcttggaaccgaagaaggttga 

agagttggcgtccttatcccaagttaaggcgagtgaaccagttgagccgaagaagattgata 

gacagattcgagtgttcttttctgttcccgagagcgtagcagcaaaaattgagctacctgat 

tccttctttaacctctcacgtgaggaattgagaagagaagcagagatgaggaagaagaaatt 

agaagattccaaattattgattcctaaatcttatcgggaaaagcaggcaaaagctgcaagaa 

agaagtacacaaaatccattatccgtgtacagtttccagatggagcattgcttcaaggtgtc 

tttctaccttcggagccaactagtgctctttatgagtttgtgagcgcagcgttaaaggaacc 

aagcttagagttcgaattgttacatccggtgcttgttaaaaagcgggtgattccccattttc 

cagctgctggggagagggctgtaacagttgaagaggaggatttggttcctgcagctctactc 

aaatttaaacctatcgaaacagattctgttgtttttactggtctttgtaatgagcttcttga 

aattagcgagcccctcgagaccggatcagttgcttcctcgtaagctctaaattacatcagac 

tttgaattcttctgagtgttggaaaccttataaaactctctgcgccgggaatgct 

SEQIDN062 

ccacgcgtccggactttctgaccttgtcaaaaacctctgtgtttctctcacatttctggtcc 
caatctcttgatatttattggagaagacgatggcagctccaccagctagggctcgagcagat 
tatgattatcttatcaagctcctcctcattggcgatagcggtgtgggaaagagttgtttgct 
tctgaggttctcagatggttccttcacaacaagtttcatcaccactattggaattgacttta 
agataagaacaattgaacttgatggcaagcggattaagttacaaatttgggatacagctggt 
caggagcgtttccgcactatcacgacagcgtattatcgaggagccatgggtattctgctggt 
gtacgatgtcacggacgagtcatct'ttcaataacatcaggaactggattcgcaacatagagc 
agcatgcttctgacaatgtcaataagattttggttgggaacaaggctgatatggacgaaagc 
aaaagggctgtgccaacttccaagggtcaagctcttgctgatgaatatggcattaagttttt 
tgaaacaagtgcaaagacaaacatgaatgtggaagaagttttcttttcaattgctagggata 
tcaaacaaaggctttcggaatctgattccaagactgagcctcaggcaatcaggatcaaccaa 
tcggatcaggccggaacttctggtcaagctgcacagaagtcatcttgctgtggttcgtgaat 
ggagacaatcgtgtgggaagaacgttcgttagttgcatttggatgtaaaaattgattgggat 
gaaaaactgattcctgttaacttcattaccaaatatttcttcgccatctgatggcaagcttg 
atgtgtcaaaggcttttctactgtcgttgtgaatctattgtcatgcagttaactagcctgcg 

ttttgataaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgaataaatctgcttttggaaacattgtgttgcatccttctcttcagaggaga 
atagaacacctcgctagggccacagcaaacaccaagtctcaccaggcaccatttcgcaatat 
gctcttttatggtcctcctggcactgggaaaacaatggttgctagggagatcgcaagaaaat 
cgggtttggactatgccatgatgactggaggggatgttgcacccctgggtgcacaggctgtc 
accaaaattcacgagatattcgattgggccaaaaaatcaaataaaggcctactgcttttcat 
tgatgaggctgatgcatttttgtgcgagcggaatagtacatacatgagtgaagctcagcgaa 
gtgctttaaatgctttactctttcgaacaggggaccagtcccgagacgtagttcttgtcctt 
gcgaccaacaggccaggagatctagacagtgctgtcactgaccgtatagacgaagttatcga 
attccctctccctcaagaagaagagcgtttcaaattgctgaagctctatttgaacaagtacc 
ttgctggtgaaggagacagtgacagcaattctaagtgggggcacctcttcaagaagaaccaa 
caaaagaggataaccatacaagatttgtctgatgatgtgattagagaggctgctaagaagat 
agaaggattctctggccgtgagattgcaaaacttatggcaagtgttcaagcaactgtatatg 
ggagcccagattgtgttcttgattctcaactgttcaaggaaatcgtagattacaaggtcgct 
gagcatcaccaacgaataaaactagctgctgaaggtatggagccaacttaccaggggaatta 
actgacaccacaaagatacaagtgtctttcactgatacgaattgttgaaaatttgtttatta 
tctctttggtagtattgcatgcaaaattcattttttccaaacttaggatattgtagtttagg 
tgtactatttctgcttggggaatgagcactggatggtggacgtgtttcagggttcaatggga 
cgttacaatttgatgggtacatagctcacttgggctgtaattgtattgattctgtggatcgc 
aggaaaatacatccattgaatagataaatagtaggcaaaacatgaagtctctttgaaatagg 
tctctgttatcaaatatcaactaacctatcttttgattacc 

SEQIDN064 

ccacgcgtccgtatcttaatccgactccatctcctatctatctctcatacacttaacataaa 

tccacaatcaaattccccactataacacacacccaaattataaagagagaaatttttcgttc 

tgtggtgtttattattgtttgtgggttttgtaaataaatggggtcagaatcagatgagaggg 

aggtgatattgggtgtagatgggggcaccacctccactgtgtgtgtttgtatgccacttctt 

ctcttttccgaattccctgatcctcttccagtt'ctgggccgctccgttgctggttgttccaa 

ttttaatagcgttggagaagatgtagctagagaaacactggaaaaggttatggcagaagcat 

tgcttgatgctggtgtgaaacgatcagctgttaaagcagtgtgtttgggtctatccggtgtg 

aaccatccaacggatcaggagaaaatattaggctggttgaggagtgcattcccaagtcatgt 

taagttgtatattcagaatgatgccgtggctgctctagcaagtggcacgatgggaaaacttc 

atggctgtgttttaatagctggtacaggaagcatttcttatggatt'tactgacgatggaaga 

gaagctcgggccgcgggtgcagggcctgttttgggtgattgggggagtggctatgggattgc 

tgctcaagcattgattgcagtgatgagggctcatgatggtcgaggtccacaaacaatgcttt 

cgagttgtattctacagtcactaggtctttcttctccggacgaactaatagggtggacctat 

gcggatccatcttgggctcgcattgcagcacttgttccagtagttgtatcctgtgcagagga 

tggagatcaacttgcagacgagatcttacataatgcagttcaagaattggctataagtgtca 

aagctgttgtccaaagactacgcttggccggggaagatggaaaaggttccttccctgttgtt 

atggttggaggcgtacttggagccaacaataaatggaatatagggaatgaagtcactaattc 

tattttaaagacttatcctggagcttgtgtaattagaccaaaggtagagcctgcagttggag 

ctgctttattggccttaaatttcttgatgaaagaaacagtagctaatggccatagttgacac 

ctgattgtacatagctaactgtgttaactgtataatcattgaagttctctttaatcggtggt 

tccaattctgggagggcatgtccttggatcatggtactgtacttgccttctctttccattgc 

atatgcagactgctaaaaatgatctgttattcaaatgaacgttgcaccaacttgttgtaaca 

tatctttgtttcctaagttgggcagtcttttggtgctggaggagagggaaggagattgtttg 

gtcatagttgcatttgtattgctgatggttatatagaattcataactgatcagtatgttatg 

taatctcttttatagcattctctgttgggataaaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcaaaatttagaaccccaaaacaatcagtactcctcactcccaatttggccc 
caatttgaattcaaaatctggaagcattcatgtgactttttcattttttcaaaaactctctc 
tcttctctccctccactctctgtgaaaccctagacacacactccatacgctctcgcaacctc 
tacctctctcttaaatcagcaaacgacagcgatctcatgacggcggtgccgcagtctgccgg 
aagagagctcgcaagcccacccaaggacggcatatccaacctccgattctccaatcacagtg 
atcaccttcttgtttcgtcttgggataagacggttcgtttgtacgatgcaagtgcgaatgcg 
ttgagaggagagttcatgcacggaggtccagttctcgattgttgcttccacgatgattcttc 
tggattcagtgctagtgctgataataccgttagaaggcttgtgttcaactatggaagagagg 
atatcttgggaaggcatgacgcaccagttcgctgcattgaatactcatatgcaaccggacaa 
gtgataactggcagctgggacaaaaccttgaaatgctgggatcccagaggtgcaagtggaca 
ggaacgtactcttgttggaacgtatacacaaccagagcgtgtttactctctttcccttgttg 
ggaaccgtttagtagtagcaactgctggaagacatgtgaatgtctatgacttgcggaacatg 
tctcaacctgaacaacggagggaatcttccttgaaatatcaaactagatgtgtgcgatgtta 
tcccaacggaacaggctatgctctaagttcfgttgaaggtcgggttgccatggaattttttg 
atctctctgaggccggtcagtccaagaaatatgcatttaaatgtcaccggaaaactgaagct 
ggaagggacatagtctaccctgtaaatgcaattgcgtttcaccctatctatggtactttcgc 
cactgggggttgtgatggttatgttaatgtctgggatggtaataacaaaaagaggctatatc 
agtaccctaaatatccttcaagcattgcagcattgtcatttagcagagatggtagactcctg 
gctgtagcatcaagttatacatttgaagagggagaaaagccccatgagccagatgccataat 
tgtccgcagcgtaaatgaagttgaagtgaagccaaagccaaaggttttgccgaatcctacct 
catgaaaactatttcagaagctcctcgatcctctcgagtcgactagtttatcttactttgga 
aaacaaaaaaactcttatgtacttaatatttcaatttgacttccaggactcatttctcgtag 
ctggaaattctggagaacagtgataaatttgtaattatccagttagcaattgtacctttttc 

gatgaaaaaaaaaaaaaaag 
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ccacgcgtccgtcgcctacatttgagcatgtcctgccccctcttcgtagctgcacctctcct 

ctgaaactcgagaaaaagtacaacaattttaagcttcagagatgctcgaagtatatagaagc 

agctcagttgagtggaagccgtcgccagtagtagccctagctactagcgccgacgattctca 

ggtcgccgcagctcgagaagacggctctcttgagatttggcttgtttctccttgctccgtcg 

gctggcactgtcaacttataatacatggaaatcctaattctagggtttcttcgttagtgtgg 

tgtaaatcggggtcgagaaggttggatgcaggtcggttattttcgtccagcattgatggatc 

agtttacgagtgggatcttttcgatttgactcagaaggctgtgctagattccattggtgttt 

caatatggcagatggctgtggaaccatgcaatacttcgcagcttcatcaaaatcttccaagg 

caqtatgagaatggccatgttaatcatacaaatggtgttagtagtgataatgagagcactga 

aggtgaagatgatgatgactcggttgttcttcatgaggatgatgatagtgaaaatggtcaaa 

ttgcatttgcttgtgacgatggttgcgttcgaatctacactactgatgagaagaatatgact 

tacaaaagatcattgcctagggtcagtgggcgtatattgagcgtcacttggagttctgatgc 

aaagaggatattttctggtagtagtgatgggtttataaga-tgttgggatgccaagttagcat 

atgaagtctataggataacagttggacttggaggtttgggtagtgaatctgaactatgcata 

tggtcattgcttgcgttgagatgtggtaccatggttagtgcagatagtactggtagcgttca 

gttctgggacacccatcatggcactcttttgcagtcacattcaagtcataaaggtgacgtga 

atgctttagcagcatcacccagccataggagggtgttctctgctggttctgatggtcaggtt 

ataatttataagctctcaaccagtgaggttgggtctcatgatggagatatttcttctgtaga 

catgaagaaatgggtttatgttggttatgtgagggcccatacacatgatgtgagggccttgg 

cagttgctgtacccattgctcatgaagagcccgtagctgaacataaggcaaagaagcagcgt 

tccggggagaagccccttgattttagttaccataaatgggc^icaattgggtgtaccgatgct 

tatctcaggtggtgatgacactaaactttttgcatactctgtaaaggaattcaccaggtttt 

ctccgcatgacatttgtccttcacctcagaggccacctatacaacttgcagtaaatacaatt 

ttcagtcaggcttctttactcttagtccaggctgcgtactggatagatattttttgtgttcg 

tgtaaaaaagggcgttgtgtctgatagctgtggccagtctggcggggctgcgagaacagatc 

tagtggctcgtgttaagtgcaaaacttcgaggaagatcacatgcagtgcaatttctccttca 

ggtgtaatgtttgcttattccgactatgtaaaaccctgtctttttgaacttaagaagagtgg 

tgctagcaagagtccatggactgtcagccgaaggcagctccctctgggactgccatttgccc 

attcaatggttttcagtgcagattcttctcgaatgatgatagcagggcgtgacagaaacatc 

tatgtggttgatgctgtaagcttggaactagttcatgttttcacacctcgtcgtcaagagca 

ttacgaagaattgctaccaaatgaacctcccattaccagaatgttcgctagtgccgatgggc 

agtggttagctgctgtcaactgctttggagatgtgtatatatttaatcttgagacgcagagg 

caacattggtttatatcaagattgaatggttcttctgttacagcgggtggttttactcctcg 

aaatagcaatgtgcttatagtatccacatcttcgaaccaagtatatgcctttgatgttgaag 

ctaagcaactaggagaatggtccaaccggaatacattctccctgccgagaagatttcaagaa 

tttcctggagaagtgattgggctttcttttgctccttctgctaattcatcatgtgtgattgt 

ctacagttcaagggcgatgtgcttgattgactttgggttgccagttggtgatgatgacgata 

ccaacttagctaatggtcaagatttagctttgacgaagctacatagtactcctgcgaatggg 

accttaaagcgcaagccgatagggaatgacttagatatgaaacaaaatggtagaaagaattt 

tgaattctgtgcattcagggatcctgttttgtttgttggacatctttcaagaacttccacct 

tgatcatagacaaaccctggattcaagtggttaaaactcttgatgcactacctgttcacaga 

cgtatttttgggacataaatctttatcacagtttttgttacagctttactaggaaacgttcc 

gaggggtgtattcaacccctttcactcatatattcttctttgttgtttgttgaagttcgggt 

ggggaaaaagttgaaatcaacactcaagttcaatatagcttcacttcatccgcaggagttct 

cctatggaaattgcgtagacctgtaaatatacttatgagctttaactagtgtccattagtct 

gttcagatattgattaatgttttcctgtataacatttattcaag 
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ccacgcgtccgcctctgagtacccttgagaagtcagagatcgatcttattcgctgccgggag 
atctgattacttcacttgttttgttcttttaggaaagatatcggatctgaccgtcaaacaaa 
gtaaaagatgcaggatcaggaagggcatgtggctgatgcaggaaaagaaacattgacatctg 
ttcaaacatctgaaattgaagattggacaaaatacaaggatgatgatattatgcaacagcaa 
tcttccatccaggctgaacaagctgtaaaaactcaatttgttggcgataaggaacctttgtc 
ttcattagaagctgaataccatctgggaaattcaattttgctggagaaaataaaggtgctga 
gtgaacaatatgctgtccttagaagaacacgtggagatggaaattgctttttccgcagtttc 
atgtttggttaccttgagcacattctggaatcacaagatcaaagcgaagttcatcgcattaa 
agctagtattgaggaatgcaaaaagacacttcaaagtttgggctacgcagaattcacatttg 
aagacttttttgcgttattcctcgagcaactcgataatgttcttcaaggtagcaaagattcc 
ataagtcatgaagaactcctacgcagaagtcgtgatccgtccatttctgactatgttgtgat 
gttcttcagatttgtaacatctggtgaaataaggaagcgctcggagtttttcgaaccattta 
tactaggactaacaaatgcctcagtggagcagttttgcaagtcatcagtggaacccatgggc 
gaagagagtgatcatgtgcagattatagccctatcagatgcgttgggtgtaccaatccgtgt 
cgtatatcttgatagaagctcatgtgagaacaacagcatcaatgtaaatcaccacgactttg 
ttcctacaagcgatggcatggggaatagtggtgtttccaagaccacaaatccatctattacc 
ttgctgtatcgcccaggacattacgacattctctaccccaagtgatgttcttcatttagggg 
tcgtttggtttgaatacagtttatgtcgggataagttatactggtataagttatgctgggat 
tagttatgctaggattgttttttatccattgtttggtatgttgtattaaatatgacaattgc 
at aa t ct gt aagaagattgtataccggt get aattaccccaccctcgataaggt at aagtt a 
tcccggtgttaattttaatcctgggataacttatacgtggtttgctaaccaaacgaagtatt 
aaggtggcat 

SEQIDN068 

ccacgcgtccggaagaaacgaagccggagaagagggctcttcttttcgtggagaagaacaat 
tataggagtattatcttatacttattcttaccaaagatggatcggtaccaaaaagtggagaa 
gccaagggcaggaacacccattgatgagaatgagattcggattactagtcagggtcgcatgc 
gcagctatatcacctatgctatgaccttgcttcaggaaaaaggatcagatgagattgtgttc 
aaggcaatgggcagggcaatcaacaagacagtgaccattgtggaattgattaagaggaggat 
tgttggtcttcaccaaataacgtctattacatccactgatattactgatacatgggaacccc 
ttgaagaaggccttctacctctcgaaaccaccaggcatgtctcaatgatcacaattaccctc 
tcaaaaaaggagctggatttgacttctgtggggtaccaaccaccattgccagcagaccaggt 
gaaagtgttgacagattttgactatgatggaggatcacctagtggtggacgaagaggccgcg 
gtggtagaggaaggggaaggtctagaggtttctcaggaaatggctttatgttggctgaatac 
gatgatggcgggtttgatcgcaatcggagctatggtaggggtaggggtcgaggcagaggtcg 
tagcttccgtggccgtggaaggggagggtacaatggtcctcaggatgcccagcaagacgctg 
acttctacaatcaagaagcacccatgcagggccgaggccgcggacggggaaggggaactcgt 
ggtaggggacgcggtttcagaactaatgggccgatccatggcggtggtgcttaaagatcaaa 
ctttgaagaatacagagattatgtgctatgagtgcctgctccatgttctatgtttttttt cc 
cttcagttgttacccgtgttaacagtaggttattgatctgtaatcagagtagactaattata 
gatttcattaccgcccgtatgtggtgagtttttttgttttttttcttgatatcttctagtat 
tttctttctggtagattaggtgcttgatcaagtgtaatttccttagtgagcagcacattctt 
taatttgtctgtgttagacatgttcagtgttgacctcagtgcgtaaatttgcctctgttttt 
agttggcagaatactcaaattacataatttctgctgcgttttatacttctttaactattgaa 
agtctttgcttttacaaaaaaaaaaaaaaag 
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ccacgcgtccgaacagaagctgatcttgtatatatcgtagacgatgacatgattccagggaa 

gaaaatgttgcaaattttagcacatgtagcagggatagacaagtacaagaattctgttttgg 

gaagcattggtaggattttgccatttagacagaaggattttacttttccgagctataggaag 

tttcgatcaaaggaagcagggctttatttgcctgatcctgcttataatatcactgttgacag 

aattgttcaggtggatttcctctccagttcttggtttctttcttctgaactagtcaagacac 

ttttcatcgaaacgcccttcactttcatgacaggagaagacttgcacttaagctatcagctt 

cagaagtatagaaatgctggatcatttgtgctgccagttgatccaaaggacaaagaaacttg 

gggtgacagtgagcacagacttgcttatgtatccgaaaccactgttatattcaaggacactg 

ttcaagtccgagacaatcaatg'gtggaaagcactctccactggttatgtaacacaatgggca 

gdaatgaatcctcagaaaattgatgcacttttctatgcccactctgtcgatgaagttaaagc 

tctcgcgcctcttcttgagaaattcaggtcaactgttggaaagaaggcctacattgttgtct 

caggaggcagcttctgcccgtgcgaagatgctgttacagctttgaactggcctaaggttgta 

tgcaaagaaagaagattcaagattatggatttaggagttggtgctctatcaagtatttcaaa 

ttcagaagtgcccgtcgttcaagcagtctatgctagcatgaaaggactaatcaacattcata 

acccgattcttgtgatcacggtagctgatgcagatcctcatgtgaagaaagcactcaagatg 

gctatagaagctaacaccaacagttcatctttagtccttttacctagatcatcggtcactaa 

gcttctttggatggctgatcttcggtccacagcattgccaaattggaatcgtatgaggcttt 

ccataaatatcatcacacagaatagagctaattcactagcaaggcttctcaaggctctcagc 

gacgcatactatataggcgatgaagttcctattactttcaacatggatagcaaagtggatga 

agcaactataaagcttgttaactcattcaattggcctcacggacctaaaagtcttcgaagaa 

gaatcatccaaggaggtctaattcgagctgttagtgagagttggtacccttcatcggatgat 

gattttggcctattactcgaagatgatatcgaagtctccccttactattacctctggatcaa 

atatgctgtcttggcctaccactatgaccctcaaatatcacttcctgaactctcatcgatct 

ctctttacacgccacggttggtggaagtggtaaaagaaaggcctaaatggaatgcaacagat 

ttcttcaagcaaattcatccaaacacaccttatctccaccaattgccttgtagttggggtgc 

agttttctttcccaagcaatggaaggaattctatgtttacatgaacgtgaggttcactgaag 

atccaaagcaaaatcctgttcagataccaaaatcaagaacaaatggttggcaagcttcttgg 

aaaaagtttttgatagatatgatgtacttaagagggtacgttagcctttatccgaactttcc 

aaatcaaacgagcttttcaacaaatcatatggaaccaggtgcacatattgctgctaaagaga 

atgtggttaagcataacaaggctgattttgaagtgccattgttaaaggaagatttcaagaac 

cttttgccaaatggaaaaatgcctccggtaacaaagttgccttcattgaacctcttcaatca 

gcctgtttctctaaagggattaaaagcagcaggagcaaaactagggaaagatgttattcaat 

gcagtccaacggagatagtagccgttcaccacgacacaggtttaccttcacattgtgcaaga 

ttctgaaaactccatactcgtccgatgatcacaaattaattcttttgttttctctcccaaat 

ttgccatgttacattacttggtggaaatgacagttaggaattggtgggagagaaagatgagg 

gtttgattcagctttatttctcatgcaagtaaggggaataaggattctttatgaatgactac 

tgatgagaatgtactcttgtaatattgcagccaaaattggctttctgtatcatcttcttttg 

cctcattttgcaatcaatgaaagtagacacatca 
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ccacgcgtccgtggtggcaaatcattctcttcttcgcggggttcttcatcttcgtcgagggg 
gtattctacgaggagagcagatcctagtttttcgtattcagttccctattatgcgccttctc 
cttttgggtttggtgggggtggtggggtttatgttggcccagctgttggttttgggtttggg 
tccagtgcctttcttatcatgatgggttttgctgcttttgttttggtttctgggtttctctc 
tgatcggtctgaagggggcagtgtgcttactgctactgacaaaactagtgttctcaagcttc 
aggttgggttgttgggcttgggtagatcactccaaaaggatctcaaccggattgcagaagta 
gcagatacatccacatcagagggtttaagctatgtgttgacagagacaacattagcattgct 
tcgacaccctgattattgcatctcagcttattcatctgttgatgtcaagaggagcatggaag 
aaggggagaatcgattcaatcaactttccattgaggagcgtggtaaatttgatgaagagaca 
cttgtgaatgtgaacaacattaaaaggaaaagttctacgagccagagggcaaatggatttag 
caatgaatacatagtggttacaatcttggtagctgctgaaggcgtttataaattgcctacta 
ttaatggaagtggagaattgaaagaagctttgcaaaagattgcatctattccttccagtaga 
acactagcagttgagattttatggaccccacagaacgaaaatgacacgttatcagaacgaga 
actccttgaggattaccctctcttgcggcctctgtaagaaaactgggatttcatgcttttct 
tttactttctaaagatcatataggctgctctcaaccactttttgttatcttcatgtatatag 
ctcgtagagcatcgataatacttgtgtaagaatgagaccaaattttcctaattgtactagta 
•aaattgttatataaaatgaccagattctccttaaaaaaaaaaaaaaaaaaaaa 

SEQIDN071 

tgtacaaaaagcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgc 
ccacgcgtgcgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagta 
caaagaggacgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaacg 
tgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcga 
gttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaagg 
aatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgagc 
tggacaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttgaa 
aggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactca 
acacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtcc 
ctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagca 
acaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttcc 
aagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatctc 
ggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgga 
ttgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctagg 
tgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacaga 
actataatccatgcctcctttaatgtgtaagggggtggttatcaac 
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gtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccg 
cccacgcgtccgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagt 
acaaagaggacgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaac 
gtgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcg 
agttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaag 
gaatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgag 
ctggagaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttga 
aaggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactc 
aacacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtc 
cctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagc 
aacaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttc 
caagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatct 
cggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgg 
attgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctag 
gtgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacag 
aactataatccatgcctcctttaatgtgtaagggggtgttatcaaccttgt 

SEQIDN07 3 

ccacgcgtccgcgaggcaacagatgaagcaggtgtgttgttaactatgagcacgttgactga 

agatggcgtgatttcggtgaagaatgcagcttgtgagaggttactgaatcagagggtggaat 

tgaaaatgaagtcgaaaaagttgaatgactgcttgaaccgcttccatgttgctatgccaaaa 

ccacgtgaccagaaagagaggccagcatgcatacctcaggcagtgttggaagccagagctaa 

ggaggctgaggcagatgctgagaaacagaaaaggaaacttgagagagatctggagaatgaga 

acgggggtgcaggtgtttactctgccagcttgaggaagcactatctattagcaaaagaagag 

tggaaggaagatgtaatgccagaaattttagatgggcacaatgtctacgactttattgaccc 

tgatatcttacaaaggcttgaagaattggagagagaagaaggtcttcgtcaggatgaagaag 

gagatgatgattttgagatggacggcgttgagctgacccctgaagaacaagcagcattagct 

gaaattcggaaacagaagagtttgctcattcaacagcatagaattaagaaaagcaccgcaga 

gagccgacccactgtaccaagaaagtttgacaaagacaaggagttcacttcaaaaagaatgg 

gaaggcagttatctgctttggggctggatccaactctagcaatcaatcgagcccgtagtaga 

tcaaggggtcgtaagcgagagagatcagttgaacgtggagatgacattggtaaggatgcaat 

ggatgtcgacaagattactcccaacaagaagcaaagattgaggtcactttccattacggcaa 

gatcaaggtcaaggtcacgacctccagatgaatttgttccaggggagggcttaaaggacaaa 

gcccaaaagaagatggctataaagatggctaagggttcttctaagaagaggaataaggatgc 

tcggcggggagaggctgatagagttattcctactctgaaaccaaaacatctcttctcaggaa 

agcgatcaactgggaaaactgaccggcgctagtaaaccaagatggcattttatcttggaatt 

tgctgatggtacctgtcaagatgcttgtgttgcaatatcttgggtggcggacagaaaggcta 

aaagaaaactcagcttgtgaggaagatgtcaagaattcaatctattgaaatggcaagaccaa 

gactacagattaagtatttaagtttgtgcttaagatgcagctgaacttgctgcctctattat 

gcatttttggaacttagatacctgttgtaagattgtgtttatcccgatgttaaattttgtct 

cagatttttttgattttctttagtacagcctttcctctcttttttgcatcaactttctgttt 

acacgccctaaaaggcgtattcagaaaatgtattcatctgccaatctccttgggatgttttt 

tttttttgggaa 
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ccacgcgtccgattgttaattactgcttctgtccccacaccacttaagagcacctcattcat 
ggcttctcccaactcactcaccactttctgcattatccagtgctcattttgctgtaaactac 
agttatttcttagctgaaaatccaagatttgctgttaattcttgacccttttgccccttctt 
ggattttctgttatttttggattcttttttgtgtcttgaagcaaaggaaggcagaaaatgag 
aggaggggtcagtggaagtttcaaacttgagcttctggttgtatttatactgcttctttgta 
tccgagactccaattgcagctcactgaagcatctaaaaggctctctattcaaggacataaag 
gaggacactcttttgccagagatctccccaaatgctgctccacagccccttcttcccctatt 
tgcaccttctccattggcacctttcacaaacagcactttacccaaattatctggactctgta 
cgcttaactttgatgctgtgagaagtatgatgaccgtgacatcaatagattgtgtagcacca 
tttgcacagtatctggctaatgtcatgtgctgccctcaactggaaacaactcttgttattct 
tattgggcggtctagtaaaaaaacaaatatgcttgcattaaatgggaccctcgcaaagcatt 
gcctttcagattttcagcaacttctggtgagccaaggtgccaatgatactttgcagcatata 
tgctctctccatccgtctaatcttactcaaggttcttgcccggtcaaagatgttcatgagtt 
tgagacgactgtggactcgtctagcctacttgctgcctgtggcaagatcgatcttgtgaatg 
aatgctgtgagcaaacctgccaaaatgctatatcagaagctgctaaaaaacttgcacttaaa 
gcatatgatcttttaagcatggatggctctcatgtgctggctgatcacacgaccagagttaa 
cgactgtaaaagtattgtacaccgatggttggcaagtaaacttgaccctgctggagcaaaag 
atgttcttagaggactttctaattgcaaaaacaataaagtgtgccctctggcttttcctggc 
atgaaaaatattacaaaggcttgtggagacgggatgaataaccaatcaatatgctgtaatac 
tgttgagaggtatgtctctcacttacaaaggcagagcttcgtcaccaacttgcaagctttgg 
attgtgctgcttcacttggtcttaagctacagaaagccaatgttagcaaaaatgtctacaat 
ctctgtcacattagcctcaaggatttttccgtacaagttgcaccagaagtttcggggtgtct 
tttgcctagtttaccgtcggatgcaatactggaccaaagtacggggatcagttttgtctgcg 
acttaaatgacaatattccggctccttggccatctatgtctcagttaccagcttcgtcatgc 
aataagtctgtgagaattcccgcacttcctgctgcagcatcgggccaaatcagtaaaggatt 
aaatatatggtcacatatgctactgatggcgtcgatgatattgggaatctgctgtatatcta 
atgctgccaatcttgcttattagctgtattttgtggaagcacattttgaccagaaagaaaat 
tcaaaaattacagttctatgaaggtctctgattgacatcaaaacttaaaatgtacagatgca 
ggaaaatcatgcacctgagtgaaaatccaactcagagatgattccaagatcaaattcgGgac 
gaaatttttattccctttctttgggcaataagaaagttgtgaaaaaaattacacagcaggtt 
tagtttcatgtaattatttccacttgacatactttgcctttatgtatttggaattcctcaga 
aaaaaaaaaaaaaaaaaagggcggccgctctagag 
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acagtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccac 
gcgtccgggatcaagaccctatattgcgttatggtggaatgtatgctttagcattggcttac 
agaggaactgcgaataataaagctatccgtcagttgctgcattttgctgtatcagatgttag 
tgatgatgtccgccggacagcagttttggcacttggatttgttatgtattctgagccagagc 
agatgcctcgtattgtatcgttgttatcagagtcttacaatccacatgttcgatatggtgcg 
gctatggcagtaggcatttcttgtgcaggtactggtctgagtgaggccatctcattgttgga 
gcctttgacatcagatgtggttgattttgtacgtcaaggtgctctcatagcgatggccatgg 
tgatggtccagataagtgaagctagtgattcccgcgttggtgccttcaggcgacaactggag 
aaaattgtcctagataagcatgaagataccatgagtaaaatgggtgcaattttggcctctgg 
tattcttgatgctggtggaagaaacgtgacaatcaagttactttcaaagactaaacatgaca 
aaattacagcagtcgttggactagctgtttttagtcagttttggtattggtatccacttata 
tatttcgttagcttagcattctcaccaacagccttgattggtctcaattatgacctaaaagt 
gccaaagttcgagtttgtatcacacgctaagccctcactatttgagtatcctaagccaacca 
ctgtagccaccacaacttctgctgtgaaacttcccacagctgttttatcaacatcggctagg 
gctaaggcaagggctagcaagaaagaggctgagaaagccattgccgagaaggcagctggaac 
agagtcatcttctggtgcaccaagttctggggagtccatgcaggtggatactccagcggaga 
agaaaaatgaaccagagccatcatttgagatgttgaccaaccctgctagggtggttccagct 
caggagaaatacataaagtttttggaagaaagcagatatgtgccagttaaatcatcaccttc 
tggatttgtgcttctgagagatctacgtcctgatgaacctgaaatattgtccctcactgatg 
caccctcgtGaactgcatccagcactggtggtggatcaactggacaacaggccccggcatca 
gcaatggctgttgatgaggagcctcagccaccaccggcatttgagtacacatcgtgatttat 
ttgtattttaaaagcttcaccaatactttggttttcattccattttggagacgatgttgaat 
ggcagaggtggaaacctatggatcaaatagcacttcctatgatcgagttgaattgtgggata 
cattgaaaagagccccgtggatactgttattctgcctcttgatttccagacttgtgcttgtg 
cttgtcattgtatttcctatgcaagagggactcaaaaactggggactggaaactgccattgc 
gcgttatctttttctgaatctgtcacgtcagctctgtctggactgttagatttttactttat 
gttctaattaagattttatattgttcggatctacaaaaagatttccactgttctccccgagt 
atttatagtcc 
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ccacgcgtccgtaaaaccctgcggcctatcggtttatcttctccctccatttccactgtacc 
atacaatacaatggccaccacttcccttagaacgcccaccactaccgtaaggccgccgtcca 
cccccgtcagcgcctccgcggtgaaacccaattgtatcactttcttatcctacttacaccgc 
agacgggggcagactgcactacttccccgacggtgtcgtattca'ccactccgctaacacaat 
tgttcagctgccgcaccggtcggttgagaaatttatagtctttgcttcaaatggggatgctg 
ctgaggccgctcaaaccgagactcaggaacctgagcaggaggtacaagaatccgagcaggag 
gagaatgtagatggtgctgctgctgaagatgcttcggatgagggtgacaatgcagctgcaga 
tgaaactgcatcattcattgcaacttcattgcagttgtacagagatgctttagcaaataatg 
acgattcaaaagttgcagagatagaaatttccctcaagtccatagaagaagagaaaattgaa 
cttcagagaaaagtagcctcattgaccgaagaactgtcaagtgagagcgaccgggttcttag 
aatcagcgctgactitccacaatttccgtaagagaacagagagagaaagactttctcttgtga 
agaatgcacaagggqaagttgtcgagaaacttctatctgttctggacaattttgagagggcg 
aaaatgcaaatcaaggtggcaacagagggagaagagaaaattaataatagttatcagagcat 
ttctaaacaatttggggaaatccttggatctcttggtgttgagactgtggagacagttggga 
agccattcgacccattgctgcacgaagctataatgcgtgaggattcagaggaatttgaagaa 
ggtgttgtattagaagaatatcgcaaaggtttcaaacttggagacagactcttacgtccttc 
aatggtgaaggtgtcggctggcccagggccggcaaagccagagacagcggagcctaaagaag 
agcaaaacgaagtcgaggagaagagtgaggaaggtactgctgaaacagcaggtgatgaaggg 
acaggtgaaggaggtaactaactaccagtgatgatgtgacaagtgagggatgtaacctgtga 
tttctcttttgtacaagcaaagaaaaggacatatttcctggtttgattgaggttgagatagg 
tttttgctggtatacctttcaattttcattaactactgtttatctgaaaggacatcatttta 
ggtcagtcggcttatgactgctgtcttaaacactattttttgaggctttggatagttgagga 
ttcatatagtcgatcccaactagcttgggatcgaggcgcaattgttgtaatactccggaaac 
aagagcgtaatgtcatatgccagactgaca 

SEQIDN077 

ccacgcgtccgaaatatgccagggattatttctcgtccgagatgatgttgatgggcaattac 
tacaataaccttggaatgaatttcaatgtaaataataatggcggcggaggaggagggatgtt 
gttttctgggaatccaagtgcgatgacaaacagtggacggagtagcatcaataattcagtaa 
tgagtcagtctggaggttgttcgagttctttttttatcgattcagtgccggggctcaagcat 
gatactgggctggcggtagagtggacccttgaggaacagtacaaactggatgaaggacttat 
caagttcgcgaatgaacccagtataatgaagtatattaagattgcagccgcgctccgtgaca 
aaactgtacgtgatgttgcattaagatgtaggtggatgacgagaaagcgcagaaaacaggag 
gactatagtttggggaagaaagtgaaagacaggaaggataaatcagcagaagcatccatgaa 
aactggtacatcctcagcttcgccattgagcttcattccatattcactctcctcaaatcatc 
gtaaccatggtgaaaatatcccttctgcagcattacttggaacgagacatctactggaagaa 
aacaatcaggctctcaatcagatttcggccaacctttcaacagtGaagttgcaggacaacat 
tgatctcttcatccgaacgagaaataatataacggcagttttaaacgacatgagaaatatgc 
cagggattatgagccaaatgccaccccttccagttttgttgaatgaggaacttgctagtagt 
gttttgcctagtatgactcagccgatgatgtttggctccacaagtggaatccagctgaaaca 
agagccaggctgctgatgcaaaacgcttggtgttaaatttggattactagcttgtgtaagta 
caccaaattttttgctgtaaatgcataaaaagctggcaggtctttgcagcttgggtatacga 
ctgggttccacgggaagaacatttatgagaacctgttttttggaag'ctgaacatctgaacac 
aagcaccaggaaatagcagcctcgtgttattgcatatcaggggaaaaactgttatcttgata 
ctgcacttacaagcatttttcttcttcttgtttcagccttctgtgtgtaaatttaggggata 
aatcgatctcaaaatcgatt 
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ccacgcgtccgcggacgcgtgggcgcgaactcagattctcaataatggcggcatcgtacgag 

tacgaagacggaggtttccaacagcaaccggattcagccgggtacgacccgaattttgtgcc 

ggattcggtaaaatcgttcgtggttcatctgtacaggcacattagggagaaaaatgtttacg 

agattcaccagatgtacgagtcctcttttcagactttaagcgagcgtatgttcaaggaaact 

ccatggccttcagttgatgccgttgcgccttacgttgataacgaccacgttttctgcttgct 

ataccgtgaaatgtggttccgtcacttgtacgctaggctttctcctactcttaaacagcgga 

ttgattcttgggataattattgcagcctttttcaggttgtgctgcatggtgtggtgaacatg 

caattgccaaatcagtggttgtgggacatggtagatgagtttgtataccaattccaggcatt 

ctgtcaataccgtgcaaagatg'aagaacaaaactgcggaggagattgcattgctgaagcaat 

atgaccaggcttggaatgtctacggtgtcctcaacttcttacaagcccttgtagagaaatct 

acgataatccaaatattggagagggagaaggaaggtcttgaagagtttactgctactgatgg 

gtatgattacagtggtggaagtaatgtcttgaaggttttgggttatttcagcatgataggct 

tgctcagagttcattgtctgttgggtgattatcatactggcctgaagtgcttgcgtccaatt 

gacataactcaacaaggtgtttacaccagtgttattgggagccacataaccacaatttatca 

ctacggctttgctaatcttatgttgaggaggtatgtagacgctatccaagaatttaacaaaa 

tccttctatatatttataagacaaagcagtatcaccagaagtcaccccagtacgagcagata 

ctgaagaaaaatgagcagatgtatgctctgttggccatatctttgtcactgtgccctcaagt 

gaaacttgttgaagaaactgtcaattctcaattaagggagaagtatggtgagaagatggcga 

gaatgcaaagatatgatgatgaggctt'ttgccctctatgatgaactcttctcatatgcatgt 

ccaaagttcattactccctctgctccaagttttgaggagcctcttgtaaattacaaccagga 

tgcgtataggctacagttgaagctcttcctttatgaagtgaagcagcaacaattgttagctg 

gtgttaggacctttttgaaagtctattcaacaatctccctggggaagcttgcaaactacatg 

gaagtggatgaacccactttaaggacaattttgatgacatacaagcacaaaacacatgctgt 

cgattccgatgggaagataacttctaatgctgatgtggacttctacattgatgaagacatga 

tccgcgtagtagaatctaaacccgccaagaagtatggagattacttcttgcgtcagattgtg 

aagcttgaagggatcatgactgatattgacaggataaagctggagtaagctatcttcctatg 

ttctagtattagtgctagcttattttgagctttcatttttgtactcgaaagcaagaaggaaa 

atgcataaagtggaaaaagtatacattttgttgttccccctctgagactgtgttaccggaag 

ttgttgataaatgaccagttaaatccatttttttctaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgcttttccatcagatttcagctcttttactccacagctgc 

agcaactacattggtggactttctggtaaattcacttggcttctccaacaaagaagccattt 

ctacaagctccaaggtaactcgttcgacactccgaaattatgagccacaattgttacttgat 

ctctttcacaaagtgggtatgaataaaacccagatcaaaaccctcgtttcttcttcccctga 

attgttgttttctcatattgataaaaaccttaaacccaaaattatggttttacaagaaattg 

gcttatctgggtctgaccttgttacatttatcaataaaagcgatttcttgatgagaggttta 

catactactattaaaccaagtcttgattatcttcgggagtatttgggcagttatgatgctgt 

agctagggttattaagaaagagcctaggctgctttccagtaatctccctaaagtaataccac 

ccaatatactattgttgcaaaatcttgggttttcgctaggggatattgagacggtttttcat 

cggcgtcctaggtatctgcttaataaccctgagtggcttgagagagtagtaaatcaagcaga 

aaagagttttaacgtacctcgggagtcacggatgtttcttcatgccattgaagcacttgtgt 

cgcttgatgaatcaaaattagaaaggaaattagatattttccggagttttggatggtctgat 

tctgatatctgtgcaatggtgcgaaaacttccttactgtttgacttcatcagaggctaagat 

aagaagtacattgaaatttttcatgaccgaacttgggtatgaacctagttatctggcttctc 

atgcaccacttttaaagtacagtatggagaagagggtcaagccaaggaatgaaatcttgaag 

tttcttaaagaaaaccagctgataaaagggaaactaagtctttacactgccgtgtcatctcc 

tgaatcacgatttcgtaagaaatatgttcttcctttcaaggagaagatgcctgagttgtatg 

atttatacatcaaaaatacaagctaaagagaggtcttcacagtgtgacagtggctgcagagc 

agtgcttgtttaagaggtttattcacttcttgataattttgtactttcattttggtgctctt 

ttcaagcatgttgctagtttacctttcattgttgattatacatttatcaaaaaattactgag 

ctatgaaaactagaaattgaggctagtctcattttcaaatcaactgatgtttcttgtttaat 

gggaaggaaagaagtgtagaaaccagacttgatgtatatgccattgattataaaaaaaaaaa 

aaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgaaaaaaagtaaaccggactattctcgaagaaaattttcc 

ggcgactgtgtaaattattttccagtgaagtttccatatttgtatactcattgtcaattgat 

tgcgttgccgtgttcagattctccattaccgatttggtaattaggtttcgagatcggttgtt 

tatcattcttctattcgttaattcgggttaacaggaattttttgatttcgattcgataatcg 

ggttcaagtatttcagaaagagaaacaaaaaggaaaaaggaggtctaaaatttgtttcaaag 

tgaagggggtttaggttgaaagatgttgaccaccacggttggtttgtgatataatatggcac 

gggttgttacagataaagatatgtcgttttacattggtcgcgaggcttcaaagttgtggaag 

agattttgtgcggagataacaacagaaatcaatcttcttgctgagaattggaagtatattct 

tggcggtttgatttgtcagtac'atccatggacttgctgctagaggtgtgcattactttcatc 

ggcctggaccaattcttcaggacgtcggcttctatcttcttccggagcttggacaagataga 

gcttacataagtgaaactgtatttaccaccatctttctaacttttgtcttgtggaccttcca 

cccttttattttcaaoaccaaaaagatctatacagttctgatatggtgcagggtcctggcat 

tcttagtcggttgtcaattccttcggatcataacattctattctacacagcttcctggtcca 

aattatcactgtcqtgagggttcaaagctt'gccacgcttcctcctcctgacaatattttaga 

agtgctattgattgttcctcggggcgtgctttatggttgtggtgatctgatattttcatctc 

atatgatattctctctagtctttgtgcggacataccagaaatatggaacacgaaggtttata 

aaacagtgtgcttggttagctgttattgcacaaagcttattgattcttgcatcgcgcaagca 

ttacactgttgatgtagttgtggcatggtacacagtcaaccttgtagtgttcttcattgata 

aaacgttaccagaactgcctgatcgcactagtgccttgttgcttccagtgaccaaggatagc 

aagtctaaagaagagaatcacaaactgctgaatgggaattctggagatcctgcagaatggag 

gcctcgaaacggqaagatcgtggaagatgggaaagcagtgcacgttgaagcagtaattaatg 

gtgcatagacgacccactaactgatgcttgcaaccttggtacagagattggtaacaacgcca 

ttacaagttgtgttaatataaatcattcctggtgctcttccaagttcaatagttttggtttt 

agcgtaggatacgaaatcaagtcaaggattgaaatgctatggatgtctccacggtcccctgt 

ggttaaatttaatgttatcaaatgtatcatggaattcattttattttttggttaaaagcaat 

tattttcttatttccaaaaaaaaaaaaaaaggg 
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ccacgcgtccgcccacgcgtccggcaacaacagcaacagcaactcttgaaggctattcctca 



aacc 
ca 
aaa 
ttc 
acc 
a 



gcagagaaacccacttcaaccgcaatttcaaccacagaatcatgctataaggtctcctgtaa 
ccagcttatgagcctgggatgtgtgcccgtcggctgactcattatttgtatcagcagcaa 
cagacctgaagacaataacatagagttctggagaaaatttgtcgccgagtattttgctcc 
_atgccaagaaaaagtggtgcgtctctatgtatggaagtggccggcagaccactggagttt 
ttcctcaggatgtatggcactgtgaaatatgcaaccgcaagccaggccgtggttttgaagcg 
ccqctgaagtcttgcccaggcttttcaagataaaatatgaaagtgggaccttggaagagct 
a ctctatattgatatgcctcgtgaatatcagaattcatctggacaaattgtcctagactatg 
caaaagcaattcaggagagtgtttttgagcaacttcgcgttgtacgtgatggtcagcttcga 
atagtgttttcacagcctgatctaaagatcatctcttgggaattttgtgctcgacgtcatga 
ggagctaatccctagaagattgttgatacctcaggtgagtcaactcggcgctgcagctcaaa 
aqtaccaggcagcaacccaaaatggatcatctactgcatctgtttctgagttgcagaataac 
tgcaatatgtttgttgcctcagctcgtcagttggcaaaagctttggaagttccattggtaaa 
tgatctaggttatacaaagagatatgtgcgatgccttcaggtatgcaccttgttctgatgct 
qgaaggtttttattttggcccttttctggaatttggagacattccgcttgtatcaatgtgga 
tatcactacaaattcttgaaatatttgcttctgttagtgcttttaacttcccgaccaggtca 
tggtttgcctgttctgtgcgtgatatctgactcaaccagttctatccaactttcttatcctc 
tccggcccccctctccttttaatctgtcatcttcccgtggaattcaaagcaaagctgaaaat 
gaaggtcgaaattcagatttctagcatgtagcagctcagacaaccaagaggttgtgagttcg 
agtcacccaagagcagggtagggagttattggagggagggagccgagggtctatcggaaaca 
gcctctctacccaccccagggtagggctaaggtctgcacaaactaccctccccagaccccac 
tagtgggattatactaggttgttgttgtaatattggaaattcctactggtaaatctgactca 

tgagttaatgtgtgaagtagacggaatgatgtggcca 
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ccacgcgtccgctgacgcgtgggttccactacatcaagacatctactacactcatctttttt 
gcacttattgggtgtaaatttttgaaacccagttgagaaaaatgagtgtgttacaataccca 
gaagggattgacccagcagatgttcagatatggaacaatgcagcatttgataatggagattc 
tgaagatttgtcttcgctgaaacgttcttggtctcctctgaaacccctttcggttaggccat 
cagattcctttgaatctgatttgtcaagtaaggaaaatcaaactcctttatttgagaattca 
tctgttaatctctcatctccgttacccataaagccacttaaccctaatggggctctggaaaa 
ttcaagactcaagccgaacaagcccaattccaaacagagtcttgatgagatggcggctagaa 
agagcggaaagggaaatgatttccgtgatgagaagaaaatagacgaggaaattgaagaaatt 
cagatggagattagtaggttgagttcaagattagaggctttgagaattgaaaaggctgagaa 
aactgttgctaagactgttgaaaagcgaggaagggttgtggcagcaaagtttatggagccaa 
aacaaagtgttattaagattgaagagcgtatatcaatgagtgcaagaacaaaggtggagcag 
agaaggggtcttagtttaggaccatctgagatttttactggaacgcggcggcgagggttgag 
tatggggccatcagatattctagcagggacaacaaaggcacggcaattgggaaagcaagaga 
tgattattactcctattcagccaatacaaaacaggcgaaagtcgtgtttttggaagcttcaa 
gagattgaagaagagggaaaaagttcaagccttagtcctaaatcaagaaaaactgctg.caag 
aacaatggttacaacaaggcaggcagttactacaattgcatcaaagaagaatttgaaaaaag 
atgatggacttttgagttcagttcagccaaagaagttgtttaaagatctcgaaaagtctgct 
gctgctaataagaagccccagaggccggggagggttgtggctagtaggtataatcagagtac 
aattcagtcatcagtagtgagaaagaggtctttacctgaaaatgataaggatgagagtaaga 
gaaatgataagaaacggtcgttatctgtagggaaaacgcgtgtgtctcaaactgagagcaag 
aatttgggtactgaaagtagggtgaaaaagagatgggaaattcctagtgagattgtagttca 
tggaaacacagagagtgagaaatctccactaagcattattgtgaagcctgatttgcttccgc 
gaattaggattgctcggtgtgtgaatgagactcttagggattctggacctgctaaaagaatg 
atagagttgataggcaagaaatcgtttttcagtagtgatgaagataaggagccacctgtctg 
tcaagttttaagttttgcagaggaagatgctgaagaggaataatgtgtaataaagggagctg 
ctaactcttttcatgctctttcaattttcaatcctgccttttaatttttgttcattcgtgcc 
ttttaattgaatggggaagcattcttttgcttcctcaaactggtattctagcttctgaatta 
cattgtatggtacaatatgaataaggttttgtcttccggcaggttgtccaagttagttttta 
gcttaaaatagatgcggca 

SEQIDN083 

ccacgcgtccgctttcacaaagcattgtgtgttctgatgggatggaactatctggcacctcc 
t gataactcgaagt cat atcagatgttatctgctgatgaagcaactgcaaaccgagat gate 
tagttttgtggccccctttggtaataatccacaacactatcacaggaaaacgtgatgatggc 
cgcatggagggtttgggaaacaaggcaatggatagttaccttagaggtattggatttcacaa 
tggaaaggtgaaggccttgtataacagagaaggtcatctaggtgttactctggttaaatttc 
caagtttaatggatgccatgcggttagcggaatattttgagaaagataaccgcgggagaaaa 
ggttgggctcgactgcagcccgtgactctaggcaaggatgacgagaacaaccctgaccttgt 
caaggttgatcataggactggagagaagaagagagtcttctatggttatctgggaactgtta 
gtgatttagagaaggttgatttcgactctcgaaagaagattaccattgctagccgatcagat 
tatgtgacatctggttagaaccacttgaatagctttacattaagatgtgcttcagttgagaa 
ttttagtcaattccctgctctagatattctggctttgtgttacttttattgcccttaggaat 
tggggcagctttctctgggataactgtggagctaagttataagtgccatgcatgcgtcttcc 
cctcttctagtgaatctttctgcctagattagcagttttaaagtccaatggactcgctgatt 
gttcttgtccttgtcccgccttcctcggtttgaggctgggtgtaccatttgggtttcgaaaa 

gttcaggca 
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ccacgcgtccggccgaaaacaatgggaagaggaaagttcaaaggaaagcctactggtcgtcg 
ccagttctccactcccgaggagatgattgctggtacttccgctcgacgtcctcgcacgttta 
ggcaggaagaggctgaacttgaggaagaagagagatctgaggagtctgaagaggaatctgaa 
gaagattctgatggagagaagaagaaaggtactcagggtattattgagattgagaaccccaa 
tttggtcaagccaaagaacgtgaaagctaaaaatgttgatattgagaaaacaactgagcttt 
cccgacgtgaaagggaagagatagagaaacagcaagctcatgaaaggtacatgaggctgcaa 
gaacaaggaaaaacagagcaagctaggaaagatttagaccgcttggctctcatacgacaaca 
aagagcagaagctgcaaaaaagcgagaggaggagaaagctgccaaagaacagaagaaggtgg 
aagctcgcaagtaacgaatagtaccatgaaatgttgttttcaattctcctagtacaagatat 
atccctaccattattggctaatgatggagtttacacttccacctttcgttcatgtcctgtct 
agtttaaatggagaagagttctctatagaggaaatcatgaaattatactttaagctctgatt 
ctgtacacaaaatagatttgtttggccaatatgatgggaggatttaccagtccttttgttgg 
gttgaaataaggttattgcgactaattaaactatcttgcagtgtgtgtgctatgaggagaaa 
tactttccatggaaaatgtttctaaggaaagtggttttttaaaaacttattttcctgtgttt 
gcttggtgtgtggacgcacgatctttgtatccctgaggtgctttttcaaaagattggaatat 

ataatggtttgagcaggc 

SEQIDN085 , , 

ccacgcgtccgctgctatctgatattagaagtatattggcaacgacacagatgtggccttta 

tatgttctcgttgagtaagttttcaaggacagaagaagaatactctcaaggatctggaaggg 

atcgtttaagcttggtagagtccttcctttgtaaacgatcagttggcacaaacaaatgccac 

aatacagtcctctcatgttcgtacttttgtgaagttccacatattaaccagttacattcatg 

ggactgtggtcttgcttgtgttttaatggttttgaggactctcggtaaggattatgatatgc 

aagaacttgaagagctttgctgcactacaagtatttggactgttgatctggcatatttgagg 

cagaaattttctgtcaacttttcctactttacagtcacattaggagcaaatccaagtttctg 

cgtggagacattttacaaggagcaattgtctaatgatctggtccgagttgatatgctattcc 

aaaaggcacgtgatactggtattaatatagagtgcagatcgattagcagtgaagagatttct 

tcattgatcttatctgggaaattcattgcgattgctttagttgaccagtacaagttaagtca 

ctcttggctggaagatattggtatatcagacttctgcaatgacaacccaggctatactggtc 

actatgttgtcatctgtggatatgatgctgatacagatgagtttgagattcgtgatcctgcc 

agttcaaggaagcatgtaaaggtctcctcaaggtgtttagaaggggcccgcaaatcatttgg 

aaccgatgaggatcttttactgatccgtttacagaaggaagagactgaaagcagccctttgt 

gatcgtttatttatttgtgtatgaatgattgtttctctctgactttgtccccgctgcgtatt 

gcccatatcgggtattctttagctgtatgtatattatgtacatcaagggctgtagtatcatg 

aatttcgcttccctgtatcatgaattttgtatatgatgcttggagcacc 
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ccacgcgtccgacttgattctctgctctccccttgactttcacacactcaaattcattttca 

tatccactctcaaatccagaaatgcaaatcccccatccccacccctccacccccatctccga 

cgccttcgtccacaagcgcggtaaaaaacgcggtagctacaactgcggccgatgtgggcaac 

ccaaaaaaggccacgtttgccatatctctaacgaccttaataatcatacagatgttcctact 

cccacaccgaccgatgccaaatcatttgttctcccttctccgctatccgttattcgtcctca 

gcaacttccacctccgccacgtcagccacttccccagctccggcgagcgctttctttcgatg 

acgtggatgtcagcgatgatgagtcgcctgtatctgatgatgatgacgtggattgtttggat 

ctggagagtgagttggatttaggtgggtccgggaagttacccgcgagtgctttgtgggaagt 

gcttaagagattgcctccatcagcgctgctttctgcggcgaaggtgtgtaagggttggaggg 

atgtttctagaaggatctggaagtcggctgaggagctaaggcttggagttcctgtgaaagct 

cagattgggcttgttggatcagtgttgcagaaatgccctggacttgttaagctttcacttag 

aatggaaagtgatgtggacgcaacgatgctggcttgcattgcattttcctgccctaatctgg 

attcaatggagatccttacttcagatacctcagttaatcggatcacaggggatgaattaggc 

cgttttgttgctgacagaaggtgccttaccaatctcaagatggaaggctgctcaaatcttgg 

ggcctttactctttcttcaac.cagcctttccactctttgcctttcggatctcttttgtcact 

ctaagatggtcttcaactgccccaatttaaaggagatttccctggatttttctcgccaagag 

aaggatagcactgatcttactgctatggtagatggtctt'ggaaggagctgcccaagactaca 

gaacattcatgttgcatctgttcggcttacacatgctgttgtgcttgctctaacagcagcaa 

atttaaggggattacgaatgctttccctagtactagggtcagaaataactgatgcatctgtc 

gctgctattgcatcgagctactcgaggcttgagttacttgatttgagtgggtcaagtattag 

tgacagtggcattgggatgatatgcaatatatttccagagacattgtctaaacttctccttg 

ctctttgtccaaatatcacttcaagtggcattcaatttgctgcagctcagttgcctaatcta 

gagataatggactgtggaatgaccatatgtgatccagatttagacagtccaacaactcagga 

aaatgataacggcgaattacaaagaacaccgattagtaaattacaccttatatatcagaaac 

tgattatcaaacacaaccgcttaaagaaactcagcttgtggggttgctctggcttagatgca 

ttatatctaaattgcccagagcttaatgatttgaacctgaactcctgtacaaacttgaatcc 

agaaagattgctacttcaatgccccaatctggaaagtgtgcatgcatcatgctgtcaagaca 

cattggttgaaactcttcagaatcaggtttgtggtgattttatggctggagacaatcatttt 

ccatccaaacgtcttcctgatggctcaaaggatcagagttcctcatttattcagcccccagc 

catttgatgatgagaagagaaagagaaggatttcaaagcgacggtgcgcggtgcttgtttat 

tagtcaaatacttgtcttgtattggctttgttgtactctagaccaattgtccattatttgtt 

atatagtgatctgaggctaaggcctgatcatgtaattttcattgattaaactatactcaacg 

tcaatacagggattgtatttcctctatcaataaaaagtacagcagcc 
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ccacgcgtccgtctcaatccaaacttcgagttcacatttgccctagctttgagaaatgatca 

tttgcaaactcaaattattgaattagatcataatagaatccgtaatatacccaaattgtatt 

gttttgttttgatgtgtgttgtgaaaatactgtaaattttgaacaattcgattatggccgat 

ggtaaggtggttaagcgtgtcaagtacaaatcttcagtcaaggaccctggcgtttctggcgt 

tttgaagttgaccaaggaacggtttttctttatgccgaatgacccaacatcaacgacaaagc 

ttaatgtggagttcaagttgattaaaggccacaggtcttctaaagagggttcaagtaagcag 

gctcttcttaatctcatgcacgatcagggcaggaattatatttttgagtttgatagcttccc 

ggaccgcgacaagtgtcgagaatttgttgcctctgcaattgcggtttgtggagaagttgtga 

aagctgcttctgaaaaacctgctgttccacatgatgaacaactcagtgcagcagaaatggga 

cgtcggattaagttactgcaggagaatagtgaattgcagaaactccacaggcaattggtcat 

tggaggtattctatcagaggctgaattttgggccgctaggaagaagctactggaacagggcg 

atatcaagaagccaaaacaacgggtggctttaaaaaacgacatgtggagtgtaaaaccttta 

tccgatggccagacgaacagagttacatttaacttgacaccggaggttattcatcagatttt 

tgctgagaaaccagctgtccgccaagcatatttgaaatttgttccgggcaagatgtcagaaa 

aagaattctggactaaatattcaagagctgaatacctccacagcacaaaaaatattgttgca 

gcagctgctgaggctgctgaagatgaggagcttgcggttttcttgaagcaagatgacatgtt 

agcatttgaagctcgtaagaagatcagaagggtggatccaactctggacatggaagcagatg 

aaggtgatgattacatgcatctcccggatcatgggctacctcttgatgaaactaaggagatt 

ctggaaccacagtatgaaccattcaagaggtcgttctcgcagtacctcaaccagcatgcagc 

agtagttcttcgaggaagagttatagatgttgagctgggtgacacaagatctgttgctgaag 

cattcatcaggacaaatcaggctgaactagctgccgaagtgtctgatgagagtgcatataga 

gaacgcatagctaaagtttctcgagttgctgaaattgaggatcttcagggacctcatgagcc 

accagttgcattgctaagtatcaaggatcctcgggattactttgattctcagcaagcaaatg 

caataaaggctttgggggatgctggtacagggacaagacagctgaaatttagtgtgagcaaa 

gaagaagccttttgct'ccttgaagaactccatcttcgagataaattcacaaggattgatcga 

accaataattagtccagaagtagctctcaaggttctcaacgggcttagtcagaatatctcga 

gtacaaagtatcatctgggaaagaacccccatgagagtgttttagataggctgcctagtgca 

acgaaagatgaactattactccattggacatcaattcaggaattattgaagcacttctggtc 

atcttatccaataacggcaaaatatttctacaccaaggtgactagattaaaggatgcaatgt 

ctcagatataccccaagttgcaggagatcaaggaatctgtgcaatcggatttcagacatcaa 

gtttcccttcttgtacagccaatgcttcaggctttagatgctgcctttgcccattatgatgc 

agatatacagaagagatctgccaaaagtggggagagaccaaatggatttgcttaggcaaaat 

ttttctccattttcatccgatatttaagctctttgttttctgggggttatatacacgaatgt 

acatttaacaaaattttgttcgagtgtgttatagcatattctatatcttgacagttctaatt 

gactgcctgcggtaattgtacatctagtggaataatggttg 
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ccggtccggaattcccgggatatggtggacccacgcgtccgctctttggatgttatggaagg 
atcaaagtggaatatgactcgaagtggcttttgtggaatgcggtctaaaaagtttgatggct 
teat cgatttggatggatatgacacgatagcgcttaaacttaaaggagatggaagatgt tat 
atttctactatatacacagagaattgggtcaatactcctggaca'agatgaagataattcatg 
gcaagcattcgtttttgtgccaaaagaaaactggtatattgcaaaaatcccgcttactcgtt 
atgtacctacttggagagggaacatgataaatgcaaagttggagatgaatccagctcgaatt 
cttggcatgtctctatctgtcaacgcagaaggtggagttccaggtgccaagtctgggcctgg 
tgatttccaagtggaagttgattggattaaagccttgcggatgcagtaagcaaaaggggaac 
atcttaaagaattattagaataggctgggacatttggggcatccacgctcaccagttgagca 
agattgtggaaatgccattcagagatggagaagatacaggttctttttctatgtaccttgga 
ggaaaagagagaattgagctgaggaaaggagtgaaaccttaaaatgcagtgactacaggcca 
caccaccaagtcaaattatcagatttttttcttgtaataaatggggctcttcaatttttctt 
taggctatcaactagtatggtaaactaagcagtatgtttaataattatatcctcgtctgtta 
caaggtttggcaatcaaataatacaacaatgtgcttggaatcggtagtactgttaaaagatt 
taatgtcaatgtgcaatgcgc 

SEQIDN08 9 

ccacgcgtccgacctcttcacctttacaaacttctacaaatatatttacttcaaacaactga 
gtagtcctattgtttctgattcgatgtcggttagaacagtgaaagtgagcaatgtctctctt 
ggtgcgtcggagcaagatatcaaggagttcttctcattctctggggatattgagtatgttga 
gatgataagtgagaatgagcgatctcaaattgcatatgtcacattcaaggatccccagggtg 
cagaaactgcagttcttctttctggagccacaattgttgatcagtctgtcatagtagccctg 
gaacctgactacgagctgcctcctacagctccagtgccaatcaaggcaactgagagggctaa 
tgcagctggtggtggatctgctattcaaaaggcagaagatgttgtgagcagcatgttggcaa 
agggcttcatcttgggcaaggatgcagttaacaaagcaaagacatttgatgagaaacaccag 
ttcatatccactgcatcagccaaagttgcttcactagatcaaaaaattggacttag-tgagaa 
aatcaatatgggaacaactattgtgaatgacaaagtgaaagaaatggaccagaagttccaag 
ttactgaaaagacaaaatcagcttttgcagctgctgagcagacagttagcactgctggatca 
gccatcatgaagaacagatatgttttgacaggggcatcttgggttactggtgctttcaataa 
ggtcaccaaggctgcaggggaagtgggccagaagacgaaggaaaagatggcagaagaagaac 
agggaagaagttcagctgcaggttacgtgcctatacatgctttctcggagtccccaaaagct 
tccaaaaccgaggaacctgccaagccctcttcacctaagggcctaattctctagcttgtgca 
aaaatatttcaaaactattgttcaattccgcttgtctgatcttttagctgtcattgtgttgt 
ggttagacttagatatgctagttatacataaaatgtcctgtacgattgttgatacatggaac 
gatagttgctggactattaaattccctgtcggagtgctgtgcgg 
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ccacgcgtccggagagtaaaagtggatcctatgaggaagagcgtcagtttgaacaatctctc 
acagtacgaacagccaaatgctaacaacagcgctgatacatctaaagtggctgaggaaggat 
atgcctctgcagatgacgctgttcaacaccactccaacagcggtcgcgagcgtaagcgagga 
gtaccatggacggaggaagagcacaagttattcctattaggattgcagaaagtggggaaagg 
agactggagaggaatctctagaaacttcgtaaagacacgtacaccgacacaggttgcaagtc 
atgctcagaagtacttcctccgacgaagcaacctcaaccgtcgtcgccgccgatctagcctc 
tttgatatcaccactgactcggtatcagctatgccaatagaagagggaaaaaataagcaaga 
aatcccagttccaccagttgtagcatcatcaccaacattgcctactactatagaggctacca 
aaaccaatgcatttccagtggcacctatcatgttaccagcacagattgatcagtcaagagaa 
agtccaactctgttgcaacgaaatcaagtgaattcgtatacgccagttcgccctcttcctat 
gctttcaatgcccaatccatcaacagtatttgaccttaacgtgaaccagatctcagaagtcg 
aaccattgtcactgagattatccttgtcacttgatcagggacaagcatcatctactagacac 
cactcggcatttaaagtaatgccaagcttcagtaatggagagagcatcattagtgtggcatg 
agatcgaaggatctgtgagaaaaaaatgaaagcaatatggaaagtaaaaataggacaagagt 
gggtacgctgcactcataattatattaagggaatgtttatttaaggagagattaattgacta 
gacatttggtcctgatttgtacagaccagaaatatgtcatgccttgtggttacctgtttaat 

gcaacgagtatactgac 
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ccacgcgtccgactttttccactgagctccactccaatgtgtaaaaccctagctaaaaatct 
ttaaagttagggtttcaaatttgcaatggggaactgctgcagatctccggcagctgtcgcaa 
gagaagacgtgaagtcttcaaacttctccggcaacgatcacggccggaaagacaagtccagc 
gccggaaaatcgcaaaaaccggtaaccgtgttaaccgatgtgaaaaattcgaacgttgaaga 
gaagtatttagttgatagagaactaggcaggggcgaattcggaattacatacctttgtatag 
atcgtaacagtaaagagcttttagcttgcaagtcaatttcaaaacggaagctacgaacagct 
gtagatgtggaagacgtgaggagagaagtagccataatgaagcatttgccggtgaattcaag 
tattgtgagctttagagaagcttgtgaggatgaaaatgcggtgcatttggttatggaattgt 
gcgaaggtggggaattgttcgataggattgtggcgcgaggacattatactgaacgagctgct 
gctgctgttacacggacgattgtggaggttgtgatgctttgtcataagcatggtgtgattca 
tcgagatttgaaacctgagaactttttgtatgctaataagaaggaaaattcgcctcttaaag 
ctattgattttggcttgtcaattttcttcaagccaggtgagaggttctctgaaatagtcgga 
agtccctattatatggctcctgaggt get caaacgaaact at ggaccagaaatagatatatg 
gagtgcaggagtcattttatatattttgttatgtggggttcctcctttttgggccgaatctg 
aacaaggtgttgctcaggccatcttacgtggggtgatagatttcaaacgggaaccctggcca 
agtatttcagagagtgctaaaaatcttgtacggcaaatgctggaaccagatccaaagcttcg 
actgactgcaaaacaagtacttgaacactcttggcttcaaaatgctaagaaggctccaaatg 
ttccccttggagatgttgtgaagtcaagacttaagcaattttctttgatgaataggtttaag 
aggaaagctctgagggtgattgctgatttcttgtctaatgaagaagttgaagacctcagaga 
aatgtttagcaagatagacaccgataatgatggaattgtttcagtccaagaactaaaagctg 
gacttccaaagctcaactcacagctggcagaatctgaagtacaaatgcttgttgaagccatt 
gacaccaatggcaaagggaccctggactatggagaatttattgctgtttcactccatcttca 
aaggatggctaacgatgaacatctgcacaaggctttctcctactttgataaggatggaaacg 
gttacattgaaccagatgagcttcgagatgccttgatggaggatggagcagaaaactgcgcc 
aatgtggcgaatgacattttccaggaggttgatacagacaaggatgggcgcatcagctttga 
agaatttgcggccatgatgaaaactgggacagattggagaaaggcttcacgacattatt:caa 
gagggagatttaatagtctaagtgtgaagctaatgaaggatggatcgcttaacttgggaaat 
gagtaaggtttacattttttcatcaaaatgaagtattgtatcgatgtgtatttgatctcgat 
gtgtatttgatctctcgccattgttttctggggtgcccattagattgtttgcttgccaggat 
ggaaaaggggcgacttcatctgggtaaccgttgtaaccatttgaaacacagaatgtatcctt 
ctactccc 
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ccacgcgtccgcttggacttggttacaaaaacagtagccttaaatagtccagctactgccct 

tgccttgtacatgcaatgcaagtacaggctgccgccatggcgaatttctcaccgttgctatt 

aacgacgatttggctagtgcttgtaatttgtaaaggagtagagagtggtcattcctcagctg 

ttggagatccaggaatgataacagatggcttaaggatagctttagaagcttggaacttttgt 

aatgaagttggtgaagaagctcctggaatgggtagccctagagctgctgattgctttcatct 

ttctgacagttctctgactcacaaggtaaccgagtcggataataagctaggagttggcaaga 

cattccctggcctgagtcctaaggctaagaataatccggacttatatgctgttgaaaaggaa 

ctctatcttggttcattgtgtgaagttgatgacacgccgaggccatggcaattttggatgat 

aatgttgaagaacggaaatt'atgacacaaaatctggtctttgcccagaaaatgggaaaaaag 

tgcccccttttaatcctggaagatttccttgttttgggaaaggatgtatgaatcaacctatc 

ttgtatcaccaqcccactLcattattagccgatgatattatgcggggaggttttaatggtac 

ctatgatttgggttcttcaacgggtggcagtagttccttctttgaggtgctctgggaaaaga 

aagttggcacagggggttgggtatttcagcacaaactcagaacctccaaattgtatccatgg 

ctgatgttgtatcttagggcggacgcgaccaaagggttctctggaggctaccactacgatac 

cagaggaat'gttaaaaactctcccggagtcacctaattttaaggtcaaattgaccttggatg 

tgaagcgagggggaggaccgaagagccagttttacttgatagatattggcagctgctggaag 

aacaatggtgctccatgtgatggagatgtgctcactgatattaccagatacagcgagatgat 

cattaatccagaaactccagcttggtgcagccGcacaaatattggcaactgcccaccttttc 

acatcacaccgaacaatactaaaatctacaggaatgacacctctcacttcccttactcagct 

tatcactattattgtgctcctgggaacgccgagcacttggaaaagccatatagtacatgtga 

tccttacagtaatccccaagcacaggagctagttcagttgctgcctcatccaatatgggcag 

actacggctatccaaccaaacaaggagacggctgggttggggatggaagaacatgggagctt 

gacgttggtgccctttccagcagactttacttctatcaggatccaggtacacctcctgctag 

aagaatatggacatctctggatgtggggactgaaatttttgttagcaacaaagatgaagtgg 

cagaatggactctgagcgactttgatgttttaatcacctcgtaaagccataataatgatacc 

cttctatttaacattgtaactgtagccaaagcaaaatcagatagtgggacaaggtctcatca 

ttcttgatgtctaaactttatctttctatactagatctgatctgacggggcaagtcctggca 

gctttatttccgagagaagaaaaaagaattttgtttttgctttaaaaaaaaaaaaaaaaaag 

ggcggccgctctagagtat 
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ccacgcgtccggcttcactatcttgagctggccatattttcatgcttgcaggaactgattta. 

ctgctttcgctgggtggcgtttttcaagtttaaagtttaggtcaaactcagaggattcgtca 

tagtttacatacttttctgctggtcattaacgaaactatgtgtactgaaacacgggggtgat 

tcaattcttctttgtcacagatattaaagctggactaaagcatctacagactggactctatc 

ctgtagttagatatgcagactaagaaaaaattaaatggaagaaatccccgagagctggctag 

tccaaaggtttcaagacagcagcggaagatgtccgagaatgtgcaaactcaggcaaagcaag 

ttaaggaacttataacatctacagtgaggaagcaaaaatcaggaagcaatttcttgaaaaag 

attgagaattatgttgctgctacagatctggatgtaagatttggattggtgtctgatgacac 

ttctgctgcttcagacgcacatgatgctgttcatgaatataatactattactattaataagg 

actataatgttgaaactgatagttgcacaaatgatactatattttctcctaccttccatata 

tccagaactattggaggggaaatttctaacagagcagacatacccaaattcattgagcaagc 

agaccagccattgcaggagcctggaaaggaaaatatggaagttgatctgctgacaagtcatt 

ttgtgctggacgaggctaccgatatagggggccagcatatctcctctgaagtttcagctgtg 

catctctctattaaagattcaaaactggaatgcattgatgaatttaatcaatttcagttgcc 

tgctgatgttagtatggaggaagaggaaactgaagagtttgatgactttgatccatattttt 

tcataaagaatttaccagacttgtactcagttgttccaacatttcggcctgtgctattgcct 

aaacaaacacggagttgcccatcaaccactcttgttttggacttggatgagaccttggtgca 

ctctacacttgaaccttgtgatgatgcagatttcactttctcggtgaatttcaacctgaaag 

atcataatgtatatgttcgatgccgtcctcatcttcgggattttatggatagagitatccagc 

ctatttgagattatcatatttactgcaagccaaagcatttatgctgagaagcttctgaatgt 

gcttgatccaaagagaaaagtatttaggcatcgtgtttaccgtgagtcatgtgtatttgttg 

atggcaattaccttaaagatctgtcagttcttggccgtgatttagcacatgtgattatcatc 

gacaactctccgcaggcatttggattccaggtggacaatggtattccaattgagagctggtt 

tgatgaccgctctgacaaagagttgctctctttgctcccatttctggaaagcttagttggag 

ttgaagacgttcgaccgattattgctagcaaattcaaccttcgcgagagaatagctgctgct 

gctacttgtccttttaactctattagaggtgatgcatttgagagatagggatccgtgtcttt 

atagattcagtcttggttacttgaattttagatttcaatggctctcgatgagttgcaggaat 

cagttctaatgtacctttgcggatgtgagtttgctagaggctgatct-ctaatgttggttaat 

ttatgtaattcacatttatgtaatggtgccataacgacgcttgagattggaggaaacttctc 

aataaggctgtatctgaaacgtgaaatcatccaagcgag 
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ccacgcgtccgcttagggttccaaattgctctaaattcccgcggattgagagttcattggag 
acttccattgttcccagcggctaagatgagccggttgattgagcatcacctagcaaataata 
aacaggacatgaaagggacagaggtttttgttggtggtttggcccgtactactactgaaagc 
aaaattcatgaggtattttcttcatgtggtgagattgtggaaat'acggttgataaaagacca 
gacaggcgttcctaaggggttttgctttgtacgatttgcaacaaaatatgctgctgacaaag 
ctctgaaggaaaaatctggatatgtgctggatgggaagaaactcggggttcgcccctcagtt 
gagcaggacactttatttcttggaaatcttaacaaaggttggggtgcggaggaatttgagag 
tattgtgcgccaggtttttccagatgttgtatctgttgatcttgcacttcttggagatgtcc 
aacctggtcagaagcaacggaatcggggttttgctttcgtgaaattcccatctcatgctgct 
gcggctcgtgcttttcgggtaggctcccaatctgattttctcattgatggcaagttacatcc 
atctgtacagtgggctgaggaacctgatcccaatgaacttgctcagatcaaagcagccttcg 
ttagaaatgtacctcctggtgctgatgaagattacttgaagaagctctttcagccctttggc 
aatgtagagaggatagctctatccaggaaaggtagctccaccattggattcgtttacttcga 
taagcgatctgatcttgacaatgctattatggcgttgaatgagaaaactgtacaagggccaa 
tgggaggtccctcatgcaagcttcaggtcgaagttgctaggccaatggacaagaacaggaaa 
cgaggtcgtgaggatccaaacatgtccagtaccattgagagtcattccaagcttttgaagga 
tgatccagatgttgagatgattagggctcctaaatcaactgctcaactggagatggattatt 
cggatccttatgaagctgctgtagttgcattacctgtggttgtcaaggagcgtttagttcgg 
atcttgcggcttggtattgctactagatatgatatagatgttgaaagtttaaccagtcttaa 
gatattgccccagtcagctgccatatctattcttgaccagttcatgttgtctggagctgata 
tgcagaacaagggaggatatctagcttcattaatttctaagcaggttgaaaaactgggaccg 
aaacaattcgatagtaggtcaaggatagaagatgttggcttgagggtgccagaaccagacag 
gttctctacaagagttcgtttgccagatctagattcatatgcctcacgagtacccttgccca 
tgcctaggactgatgtttacacatctcactattcagcgtatttagatccccatctgtctggt 
cggatgacagcaaagaggatggaggaagcaagttcccatttgcaggcgacttcacttctgtc 
tagtcgggtggcaacgaggatggaggaggcaggttccactttgcagtcgctcctatctggtg 
gggtgacgacaagaaggatggaggaagcaagtccgattttgcaggcaacactccttccatct 
ggtcgggtatcaaggatggatgaagcaagtcccaatttgcaggcaacatggagcccttctcc 
tactaatgacagaattggacttcattcacacattaccgcaactgctgatcatcaacatactc 
gaccacggatcaggtttgatcccttcactggtgagccatacaaatttgaccccttcactggc 
gagccaattgttcccaagagctcaagtcatcatcgaagcctgtactgaacgttctgagcatt 
ctaatttacaaatggcttattgccaaacctatgtaacataatgatgcgtatttttgttcatc 
cgcagctgtaaaatagtagctgttagcaggattatttggttatgtttctcattgacttcatt 
gattgcgaaggtgcatttggaatctcggcaatcacaatttatagccggtgca 
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ccacgcgtccgcccacgcgtccgcaaaccctcccgcgaagctaaatttcccctttttctctc 
tctctttgattcgaataagagaaattgggggtttacagtaattggggttttcgtatatttag 
ttctgtaaactcatccctcgactcgattcttcttttgatttgcattgattatcattagattt 
gactttgattttcaattcaattctaattgatggaggatactaatcagcagaacgtcgatcga 
ttcacgtctgtttcatcttcaggtgaaagagccgttgagccacataatgctgcagaacagcc 
tatttcgcccaaagatgaaaggactgtttctgcaaatgcttctgtgaatgcaatcatccctg 
gggctttaagaaatgctaaagatcaccctgttacctctgaaactggagctctgtccgccttt 
tatcctctcaattcctattctcctcaggaccaaggtttttactatggaggttacgacaacgg 
cactgggagttgggccgaacaatccaatgatgtcaatgtgaacttgcatgtagttccgccag 
caatgtacaatgagaatcccctctttttccctccgggttacggctttgatgctcagatggca 
tttggacagttctcccccatcgctagtcctctttctccttttatgatagatggccagctata 
ctccccacaccagattccggtttctccaaattactatgcaccacctatttcccctggcttgc 
cgcatgttacatcagcrcttccagcttcgcagcctgatctggtggcaccaggaagcactggc 
catgaaattgatagcatgtattttgggccaggatcaggttactacatacccgttggatcgtt 
tggcggaggcgagctttcgggaagcagcaacattggtttctacaattaccaaggtgaatttg 
gatctggtcaatctttacctaatcgacctaaccccctggactctggaagatacatgtctcaa 
atgacatctgcggcactatatccacaaccagttggcatacttgggtcgtacgaactaaacgc 
tgcaggcttcacatcaaggtcttggattcacaccaggctcctcaggcaggaattattccc 
ggcaatccttatcctagtgcaaactatggtactgggtctagttctctgtgggaaccaggt 
_cagaaattggctaactcctgacagaggtggaagacgtgagagggatcggcactctgttaa 
catttctactgaatcacttggtatggcaagtgaacgaaaccgaggaccaagggcattaaaac 
aaagagcaagggtcccattgaggatagctcttcatctgtcatccgtaaagaagttgagtcg 
ctaatactttgcagcctgagcagtataatcggcctgaatttgtcactggttatgaacatgc 
aagttctttgtcatcaaatccttcagtgaagataatgttcacaaaagcatcaaatatagtg 
tgtgggctagcactcctctgggaaatggaaagcttgatgctgcttatcgtgaagcaaaagag 
aggaatgctgattgtcctgtttttctctttttctcggtgaatgctagtggacaattttgtgg 
ggttgctgagatggttgggcctgttgattttgagaacaatgcggagcactggcagcaggatc 
gatggagtgggcaatttcctgttaaatggcatgtcattaaggacgtgcctaacagtcagttc 
cgccacctacttctggaacataatgacaacaaaccagttactcacagtcgagattctcaaga 
ggtgaaattgtcagagggactagaaatgttgaaaattttcaaaaactatgaagcggatacct 
ctatattggatgatttcacctattacgatgagagggagaagtccttgctggaaaagaagagt 
aaacagcgaacacttcaacctggtagtgctgcagttactactgcagctgacacaataagtca 
actagcggatagtcttgccggcacattaaacttggaaggcaacaagaaattgccttaaaaag 
agtttgtaatgcttaagcctgtagcagattccagaggcaatatcaactgctgtcattcaatg 
ttagttggttgggcaaccagctggctcgtatagttaaggagattgctaacagcattttggga 
ggtggccttttcatcacctttggatcagaaaactctctttatctctttcctttatttgtcat 
ttgttcaatgagttggttgtgatttgcaattcttgggcaaggagaggcagccagtaatgtta 

tgagctatcgtttaccc 
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ccacgcgtccggggttggcaagaaaaattctaaagagaaaaggaggaacacattgattttac 

gaggactagtcattcatttttcttggacagtcttggaaactaacagcttgattgctgaccct 

ttctcctatcacggctagacagccaaactccacattcttatataaagaccacccttttcatt 

ttggatttggtaaaacaaaggaagtccagaagataatcagagaaagatgaaatttgggaaag 

aatttgcatcccaaattgtccaagaatggcaagaagcctatgtggattacaattatctcaag 

agtgttttaaaagacatcttgaatttcaatattgccgcttcacctgaagttgaaggctcctt 

aaagagaaggctatctatgtacagagcctttagtggattacaaactagtttcaaagtttctc 

aaaacaatgaagatgaagccatattagtgagttcagaaggccactatcaaactatgtttctt 

atgtcatctgaaaaaggtggagaaaatgagatggttttctttaaaagacttgatgatgaatt 

caacaaggtgataactttttaccagaaaaaagtagaggaagtgaaggctgaggctgatgagt 

tgagtaaacaaatggatgcacttattgctctaagaatcaaggttgataagccttctataaga 

atcaaaaattcccatttgggaaatccaggtaggtcagaaatggaggcaatacaagaagcaga 

gatgacaagtgaagaagaagaagcaacaagagggaaaagagatacagcaaatacaaaacata 

tggaatttaggcctgctccactagagattttggaccatataaaaatcaatcttgaacccgaa 

acacctgtctcaactttaaaaaatatcatccatacttcaaaatccaacttatcattcagcaa 

agaggagctcagaaaagctgaagaacaaatgagaaaggcttttgttgagttctatcaaaagc 

ttcgacttctgaaaaacttctgtctcttaaatgtgttggcattttccaagatcatgaagaag 

tatgacaagatcacctcaaggaaagcttctaaatcatacttagagatggttaataaatctta 

tcttggtagctctgatgaggttgctaagctcatagaaagagtggaggccacattcataaagc 

attttgtcaatggaaatcgaaggaaaggaatgaaatctttaagaccacaagctaaaagagaa 

acgcatagagtaacatttttcctgggtttgttctctggcggctcaatagcattagtggcagc 

tattgctgtatccatacatttcggaaaccttctacagcatgagggtcgtgggcagtatatgg 

aaaatatatttccactctacagcctattcggatacattgtcctccatatgctcatgtacgcc 

gggaacatatactactggaggcattttagagtcaattatcccttcatttttggcttcaagca 

gggaacagaactaggttacagacaagttcttttccttgcttctggtctttcagtacttgcat 

tggctgctgcattgtcccacctagatatggagatggatccaaatacaagaagttttgagaca 

gtgattgagctgatcccacttgccgtggtgtttattctgcttctaataactttttgccctct 

gaacatcatatatcgttcaagtcgcttcttccttataagafgtggttggcactgtctatgtg 

ctcccctttacaaggttaatctaccagatttttttcttggcagatcagcttactagccaggt 

tcaggcaattaggagtttgcagttctatgtctgctactatgtgtggggcaacttcagaacaa 

gatctaataaatgtcaagaaagcagtgtttatcaaatcttatacatagtcgtcgcaattatt 

cccttttggtctcggtttattcagtgccttcgccgcttatttgaagagaaagattcaatgca 

ggggcttaatagcctcaaatatttctcaaccattgttgctcttgtgatgaggacactttatg 

ctcaaaagagaggaacgttttggagagtaatggcggcatcatcctcaggaattactacagtt 

gcaaatacttactgggacattgttatagattggggtttattgcaaaagaattcaagaaaccg 

ttggttgagagacaaactgcttgtgccacacaagattgtctactttgttgccattgttcttg 

acattattctgagactagtatggatgcagttggttcttgattttcaagaactaccatttctg 

cacaagaaagcaatggttgcagtagttgcctctctagagatccttcgccgaggcatgtggaa 

ttttttcaggttggaaaatgagcacttgaataacgtcgggaaatatcgtgccttcaagtccg 

taccattgccttttaactacgaggaggacaagagtctatatctatacctctagctgatacgc 

agaagtcgaaggaatccagggttttcttttctttctttttttttcttgcacaaattcttctg 

attcgttgccgtatattggt 
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ccacgcgtccgaaaccataaacagagcagagagcgattgagagagagagagagaaatggaga 
ctgtaaaaaagagtgcatcggcaatggaagcattcgagaagcttgagaaggtaggggaaggt 
acttacggaaaggtgtacagagcgagagatagggttactggcaaaatcgtagcactgaagaa 
gacgaggcttcacgaggacgaagaaggtgttcctcccactactctccgcgagatctctcttc 
tgcggatgctctctagggatcctcacattgtcaaactgatggatgttaaacaaggccagaac 
aaagaaggaaagacggttctctacttggtctttgagtacatggatactgatgtcaagaaatt 
tattcgtagtttccgcgcaaatggagaaaacattccccctaaaactgtcaagagcttgatgt 
accaactatgcaaaggagttgctttctgccatggtcatggcgtgttacacagggatctgaaa 
ccacacaatcttctgatggaccgtaagacgaatgtgctcaaattagcagattttggacttgg 
cagagcttatactctgcccatcaagaagtacacgcatgagatattaaccctatggtatagag 
cccctgaggttcttcttggagctactcattactccacagcagttgacatgtggtctgttggt 
tgtatctttgctgaactggtcacaaaacaagccctcttcccaggagactctgagctgcaaca 
actgcttcacattttcagattgctaggtactcctaatgaagaactctggcccggggtgagca 
agctagtaaactggcatgaatacccccaatggaacccccagccactctcaactgctgtccct 
ggtctagatgaagatgggctccaccttctaactgagatgttgcattatgagccagctaagag 
gatttcagcaaagaaagctatggaacatccctatttcgatgatttggacaaaactcctctct 
gaagtcccgctcatgacccatctgttgaaaaattgcaaatttctcatcaccggagatcaaca 
aacccatctaacccctcatcgcaagcttttattgcttttctcaagcatcttttaatagtatc 
aattagtatgactagcttcacctaaaaactttgtctttctatatcaattggatcagtgtagc 
acaattatgtggaatgatagaaccgca 

SEQIDN098 

ccacgcgtccgatttcctcggctattttctgcactgactcacgatttttcggacgctttgtt 
ctcgccgtagcgcggatattatacactttgtacaatctctgtagtgatcgccgattgatttg 
ccgctccggtgaagttgtccttgccgaaaattttctctcagatcttgtgaggcaggtggctg 
cagttgttgtaaaggttgaagtagctctagacaaaagcatttgcatgttgaccagatgagca 
gaactgatgttatttgcagtagaaggaggaggtttcttctcgtcttcagcttctggatatag 
taagggcctgacccttctactcttgggtcagaagaacgaagagaagcccatgagagttgcac 
cgtggaaccagtaccagttggtggaccaagaaactgatccggacctccagctggcttccggg 
aagaacagggttgtccgcgggtgcgcctcctttgtatgctttggtcgcgctgccgctggact 
tgagagcccatctccccttaaagtcggtcctacccaacagccagaagtcttgcctagctgtc 
ctgcttctgacaaggacaacaatcagtcgcagtgt.gttaatattattgaagacagtcatatc 
tcaccaaaggttgctcttcggagtagcttaaagaaaccagcaaatagtatacccatttctgg 
tggtaatggtaatgaacgcggcacaaattctctaaagattgatgatgcccccaatcctatgg 
agaaaaggaaagtgcagtggacagacacatctggaggagagctttttgagataagggaattt 
gagcctagtgatgatggtgaatcagatgatgaatttgagagtgggaatgaaagaacttgttc 
ttgcaagataatgtaattttagctccttatagaaggtttcatggctgattttcatgtaggag 
gcaacaattgaggtgctgcagatgatattacgtgggaggttggtttcgacacatgccttttg 
ctcaactatcagtcgacacaagtgaattttgaggctttattaggtagcgtcacgttgatcat 
gggcttttctggttttggctgactaatctgttgtctatattcaaattcttgaatgtcagttt 
tgtttctctggggcccgctcgtctattgttttcataattatattttatcttcatttttttaa 
ttaaaagatttctggtcctcttttatccaaaaaaaaaaaaaaggacgaccgctgtagggt 
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ccacgcgtccgagcatatattcttcttcttcttccttttggcttctatttcattagcttata 

aaaaccaaaacacaaacccacaaacaaagaaacgttaattagctttgtggagtggagagacc 

ttttattgaaagggagcttaactgttggaccagctgagcactacttgccaaagatttgaact 

ttcttggttttttggactggaagttagataaagctcaaatcttttgggtttattttgtatct 

ggtacagtttcttgaacaagaatagcaggaccttcaaggctgaaaagggaatattttcggct 

tttttggagtgatttgttggatgatcaggaaattgttttgtatttgagggaagcagttaata 

aggtagaatagaaatgatggatagggtgccaagattgcttatggaggttctaacagaaccgc 

aacgaggaggagagtctttactcgggtcgattaagattgctgttttaccgattgcaaaagtt 

tttaccatgtgcttcttgggatttcttatggcctccaagtatgttaatattcttccagctaa 

tggacggaagctcttaaatgggttggtgttttcacttttactgccctgcttgatattctctc 

aacttggacaagccatcacatatgagaaactgcttcagtggtggttcatccctgttaatatt 

gttatcgccaccatatctggctctattataggttgcatcgttgcttcaatcgtccgtccacc 

atacccttatttcaagtttaccgttgtacaaataggcattgggaatattggaaatgttccac 

ttgttctgatagctgcactatgtcgggataaatcaaatccttttggagactacgagatatgt 

tcgcgagatggaaatgcatacatctcatttggccagtgggttggtgcaatcgttctctacac 

ctatgtgttcaatatgctcaaacctcctagtgaaggcactttcgacgttcaagatgcaaatc 

ttcctatcaaaagtcctaacaaagatggctcgcagagccatctggtagctagttcaccagag 

caagttccattacttacaacagacgtagcaccggctgactcaagtggttcaaagaaagaaaa 

ggttaaagagttctttaattttctatatgagaaactgaagctcaagcaaattattcaacccc 

ctattatagcttctgtcctagccattgtcataggatgtgtgccaatcctgagacgacaggtc 

tttacttctgatgctccactttacttcttcactgacagctgtttgattcttggggatgccat 

gattccctgcatattgctggctttaggaggcaatcttgtcgatggaccaggacccggaagtt 

caaaacttggcctaaggacaaccgctgcaattgtctttggacggctggttttggttcctcca 

accggacttggcattgtcatgttagctgataagcttggattccttcccgctggtgataaaat 

gttcagattcgtactcctccttcagcatacgatgcccacatccgtactttctggtgctgttg 

ccaacttgagaggatgtgggaaggaggcagcggcggtattgttttgggttcatatttttgct 

attttctctatggctggatggatcatcctctacctcaacatactcttttaagttaggatcaa 

acagtgttgctacaaagagtaaaaagaagagatcttgggatggaaggtttttattcctgtta 

ccaggatcgcgccagcctttcgtaaagctgctgtttttagctcattcaattgcctcattgcc 

atttgagactaagagagagatgtattaatattatgtaggaatattacctactacatctataa 

gtataattagtcatgatggagttaaccaattgctccttatttgttcttggcttcttctactg 

tataaccttagcttatgctaccttgaaactggctatgtcaaagttggacttggcatttggca 

gacaaagatgagacatgatgttcattggaagaataagtaaacgttgaacagc 
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ccacgcgtccgagcttaagaaaagaacacttgccctgctgagtatctataaccatatagata 
ccagtaccataattttgcatcatcttcaactctaagggatcaaagctttattgtccaaagaa 
aaaacttatggactcccctcataagccaaggtctttttcacctaacctattctttttcttcc 
ttcttgtatcctcaaatctcctgacttttttcatttctaacatatttaagaactcttcttgt 
tctctataccagcaaacatataaagccattgccactgcttcattaaacaatgctatcccttt 
tgttgttaagtcagaaactagagatgttgttcatgtatctgataaaccagcagctgatttag 
acttaccatctgagttccttgctttcacatctccacatcaactgccatttggagtcaactcg 
agctttaattctgacaaactcatccctcccgttggccgtccatgtactatgtttccagattt 
acttcggcgttacatgtcatacaaaatcaatggatcttgtcctgatgatgagctcctggcac 
agaagctgcttctcaaaggttgtgagcctctccctcgccgtagatgccgtcctgcagctcaa 
caggaatatgttgagccttatcctcttcctgagagcttgtggattactccatcagattcatc 
tgtagtttggacagcatat-acatgcaaaagctatgaatgtctaatcaacaggaaaaaaaatc 
aaaaagcatttgatgattgcaaagactgctttgatctcaatggcagagagaaaaatcgttgg 
ttgtccaaaaagggagctggccttgacttctctattgatgaagtactagcagtgaagaagcc 
gggtacaatcagaatagggcttgacattggtggaggcgtagctacatttgctgtaagaatga 
gagaaaggaacataacaatattaacaacttcaatgaatctcaatggtcctttcaatacattt 
atagcatcaagaggagtcgtacctttgtacataagcatttcgcaacgacttcctttctttga 
caatacactggatatagttcactcaatgcatgtgctg-agtaattggatacccacaactctgc 
tccacttcttattattcgacatctatagggtgcttcggcctggtgggctgttctggcttgac 
cattt'tttctgtgttggtgagcaatttgagcaagtctatgctcctcttattgacagcattgg 
gtttaataaggtcaagtgggtcgttgggcgaaagatggacagaggccccgagctgaatgaga 
tgtatctttcagcactgttggagaagccactgaagaactcttggtgatagtattagatgttt 
cttttactttcttacttttgatagttatagaagagaagatagaaggaggtgtttattttttt 
taaaattatagattcatttcagatgacatcttctggcataaactagcagtttgaggtagctt 
gagtatgattttgtaatgttggtgggctaaaccttagagctttagcggcc 

SEQIDNO101 

ccacgcgtccgtgatgctgttcatgtatctgataaaccagcagctgatttagacttaccatc 
tgagttccttgctttcacatctccacaacaactgccatttggagtcaacccgaactttaatt 
ctgacaaactcatccctcctgttggccgtccatgtactatgtttccagatttacttcgtcgt 
tacatgtcgtacaaaatcaatggttcttgcccggatgatgagctcctggcacagaagctgct 
tctcaaaggttgtgagcctctccctcgccgcagatgccgtcctgctgctcaacaggagtatg 
ttgagccttatcctcttcctgagagcttgtgggctactccgtcagattcatctgtagtttgg 
acagcatatacatgcaaaagctgtgaatgtctaatcaacaggaaaaaaaatcagaaagcatt 
tgatgattgcaaagactgctttgatctcaatgggagagagaaaactcgttggtcgtcgaaaa 
agggagctggccttgacttctctattgatgaagtactagcagtgaagaaggctggtacaatc 
agaatagggcttgacattggtggaggtgtggctacatttgctgtaagaatgagagaaagaaa 
cataacaatattaacaacttcaatgaatcttaatggtcctttcaatacatttatagcatcaa 
gaggagtcgta'cctttgtacataagcatttcgcaacgacttcctttctttgacaacacactg 
gatatagttcactcaatgcatgtcttgagtaattggataccaacaacactactgcacttctt 
attattcgacatctatagggtgcttcgacctggtggactgttctggcttgaccatttcttct 
gtgttggtgagcaatttgagcaagtctatgctccccttattgacagcattgggtttaataag 
gttaagtgggtcattgggcgaaagatggacagaggccccgagctgaatgaaatgtatctttc 
agcactgttggagaagccactgaagaactcttggtgatagtattagatgtttcttttacttt 
cttacttttgatagttacagaagagaagacaggaggtgtttttattttttattttttatttt 
tttttaaattatagattcatttcagatggctcttcaggcataaactaacagtttgaggtagc 

gtgagtatgatcttgtaatgttggtgcggcta 
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SEQIDNO102 

ccacgcgtccgcaaaatccacaacaatctcaaattggattttctaatctgtaatattaatta 
ct teat tcaaattatgtaaattcttttgtataaaaacccttaaaaacacaatctttt cat ca 
attctcaattgggtttctcttcttaatctgtaagtttttgttacttcattcaattttgtata 
atggattctgatttttctcccgggtgtgggtcgggtatacaatcagactttgcgttcgcttt 
caatgatagcaatttctcggatcggatcttaaggattgaaattgtacccgacttgccggatt 
gtaaaacgggctgtgaaggttgtactggcggcattgatgattgggcccggaaccgcaagcgt 
aggagagaagacatcaagaaagaaaatgatgcggacgtggtcatgcaacgtgaggagcaagt 
agtaaattgtaatgtgcttgaaatggaagatggtcttgctgatgatgaacaagatgaagaag 
ctgtaggaatgcttgaggaatcaccctctggcattgagatgaccacaaatccccctggcgat 
gatgaagcttctaaaagcgatgatgattcatctacaaacatggactcttcaaccccccttcg 
ggtgagaactatacatatcagttctcccattttggcagctaagagtccattcttttataagt 
tgttctcaaatggcatgaaagagtcggaacaacggcatgtaaccatacgaatcaatgcgtcg 
gaagaagctgccctcatggacctcttgaattttatgtatagcaatactttatcaactacaac 
actcactgccgtgcttgatgtgttgatggctgctgacaaatttgaggttgcgtcatgcatga 
gatactgcagccacgtactgcggaatcttcgcatgacttgtgaatcagcattgctttatttg 
gatcttccttccagtgtactaatggctgatgcagttctgccgttaacagatgctgcaa.aaca 
gtttcttgctgcacgtttcaaggatataaccaagttccaagaagaggtattgaatttgcctc 
ttgcgggaattgaggctgttctgtccagtgacgatcttcagattgcttcagaggatgctgtc 
tatgactttgcgttaaagtgggctcgcatccattacccaaagcttgaggaacggcgggaagt 
attgagctcacgtctttgtcgactcattcgatttccatgcatgacatgcaggaagctgaaga 
aagtcctaacatgcaatgattttgatcctgagcttgctacaaagcttgtcctcgaggctctt 
ttttataaggccgaagcaccatatcggcaacgctccattgctgcggatgcagggaatgcttt 
gtgccatcgttacatggagagggcatacaaatacagacctgttaaagttctcgagttcgaag 
cacctcgtcaacagtgtgttatttacctagatttgaaaaaagaagagtgtgctagcctcttt 
cctgctggtagagtttattcacaggctttccatttgggtggacagggatttttcctgtcagc 
tcattgcaacatggatcaacaaagtgcattccattgctttgggctgtttctgggcatgcaag 
agaagggggcagtgtcatttgcagtcgactacgagtttgcagttcgtaccaagccaaacgag 
caatacatgagcaaatacaaagggaactacactttcactggtggcaaggttgttggctacag 
gaacctgttgggtgtaggttggagcgcgtttttggctgacgatagtgcttacttcatcaatg 
gacttctccatcttcgagctgagcttactatcagccaatagagagtttaaatactatcgctg 
tgcttctgctgacagctaaactatactttttacttcagtgaggccttaagaagtttacat.tg 
tagtggcatcttacttgaaagtgcagcacatgtgagcaatagttgtatgggctatatattgc 
ttgttacctattggcatatatgcactggtgtaaattagtaaaatcagtctttgagcggttca 
tattttgacaatcacagtctttgttaagagttctagctgccc 
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ccacgcgtccgcccacgcgtccgcgctgagcgtttcaacgacttagcttctgttgattctga 
tcaactgctttcgatcccgtggattcgtaaactcttggatgttttcctctgttgccaggaac 
aattcaggtccattgtgtttaacaacactgcttacttgaataaagctcaatggaccgttaca 
ttactgattatttcgataggagtgtgaagggtttggatgtttgtaacgcgataagggacgga 
attgagcagatcaggcaatggcagaagcagatggagattgttttgtgtgcattggagaatca 
gaggagtgttggtgaaggccaatttcgtcgcgctaagaaggcgttgattgatttgactattg 
gtatgctagacgataaggattctaatgcaactgttaaccatagaaacaggtcattcgggcga 
aacaatactcagaatgatcataggtctatggggcattttagatcgttatcgtggagtgtatc 
gaggaattggtctgctgctaagcagctccaagcaattggtaataatttagttgctccgaaaa 
gtaatgaaattattgctactaatggattagctttggctgtttttacaatgagttatgtgttg 
tactttgtaatgtgggcactagtggctgcaattccttgccaagaccgcggcctgcaaacaca 
tttttatgtgactaggcaattcgtttgggccggcccaattttgtgtcttcatgaaaggattt 
tggaggaatcgaagaagagggatcgtagaaatgcttgtggattgttgaaggagattcaggag 
attgagaaatgcgtgcaccaaatgaacgaattgatcgatactgttcagttcccaatcacaga 
ggaaaaagatggagaagtaaaggaaagaattcatgaacttgggcttgtctatgatggtttaa 
agagtggattggatcctttggagcgccaggttagagaagtgtttcataggatcgttcggagc 
aggactgaaggccttgactctattggaagatgaaatcatgagtgacaaatttgtgtagaatt 
gggtgatggttgtcttttgagaaggcatctattattagaacatgagcataatatgatataga 
ttttccccttttttttcttttctttgtttgaccctttttttagatgagaagaggagggagaa 
tggttaatggtgtaatgtgcctaaagaataagtagtttaaagaggtgaaaatcatgtatttt 
cactttatatatgtaaagaaattaggaaaaaaggtggatctttggtccatttttggtgatgt 
tcatcttgtttggaattgtataatcacatttctgagttaagttgctttttaaaaaaaaaaaa 

aaag 

SEQIDNO104 

ccacgcgtccggttttcttctcatcccaaatcgcactctagggttacgccgcctctatcagg 
aaatcatgcctcgccgaagctctggaagatctgctcctcgtcctgcccctcgtgcggcccct 
cgtcctgctccagctccagtacaccatgctcctccaccagctcctatgcaaagtagcggtgg 
tggatccatgcttggtggtattggttctaccatagctcaagggatggcctttggtactggaa 
gtgctgtggcacacagggctgtagatgcggtcatgggtccacgcaccattcaacacgaaact 
gttgcttccgaggtacctgctgcagcagcagctcctacaaccatcggtgctgggtctgatgc 
ttgcagtatgcactctaaagcgttccaagactgcatcaatagctctggaagcgacattggca 
agtgtcaattctacatggatatgttgtccgagtgcaggaggaactcaatgctgaatgcttaa 
gcttgttgtgtctcattttaataactttgaactcattcttaatctgattgttgaaacagcga 
tggaattatgacaaaaggcttggtggtattgatggagcaagtgaatttggttcttgatacac 
ttttgggtcaaataatttatgctgaaatatgaactttatagacttctta 
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ccacgcgtccgcccacgcgtccgcccacgcgtccggccttggctctcactttcaaattcccg 
acctctttctagcgccgaaattaccagcacgcagagcaaggacaccattagccgttcggcca 
ctgacccaaatggaaatcggcggattagactccgatggccgtgaatttagaaacgcggacga 
gatgtggagagaagaagtaggagatggtgaccaccaaaagaagtctcaatggtataacaaag 
gcatcaat'tactgggaaggtgtggaagccacagtggatggtgtgctgggcggatatgggcat 
gtgaatgaggctgatataaaggcaagtgaggaatttctcaacaccattttgccagaaaggtt 
ccctgatgctggaagaggccgccatcttgtagctctggattgtggatctggcattggaaggg 
ttaccaagaatcttcttatacg;atatttcaatgaggtcgacctactagagcctgtatcacat 
tttttggaatcagcccgggtaaatttggctcctgaaaatttaatggtgtcagagttgcacaa 
agctgccaatttttattgtgttccactccaggaatttactcctgatgctgaaagatatgatg 
ttatttgggttcagtggtgtattgggcatcttgcagatgatgactttatttcattcttcaag 
agagcacaggctggcttgaaacctggtggactttttgttctaaaagagaacattgcaaaaac 
aggatttgtattggacaaagaagataagagcatcacaagatcagattcatattttaaggagc 
tgttcaatcaatgtggactatacatctacaagatgaaggatcaaaaaggatttccagatgaa 
ttatttgctgtgaagatgtatgcattgactactgagatgccaaggcaaggtaataaacctag 
acctaaacggacaactaatagacctgctatcatcagatgatgaatatcacattggtgttgtg 
tggttttactaactttggatgaagtaattcataggttattgtttttaggtcacatgtatgcg 
agttctgtcaatgttatgttattgcttttggatataagttatatacattgatagtgaagaga 
tttgttgtgtactttagcttattgtaggttacttcttatgttgaattatttatgcaaccgct 
tttgtatcaatgtattctgctcttcttgtaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

aaaaccaatttaaaggtctgg 
SEQIDNO106 

tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 
ccgcccacgcgtccgcaaggcttagggatgacgttgcccgataggaatgctaagaaaaaggg 
gaagcaaaaggcttccggtgagtcaaaagactcgcacgtagctgaagctcttgataagctta 
gggaacagactagagaggctgttaaaggattggaatcagtggccgggccgagacctggtgta 
gatagcttggggaatgatgcaatgatggaggagtgggttaagcagtttgaggagctttctgg 
atctcaggacatggagtcgatagtagagaccatgatgcaacagcttttgtcaaaggaaatcc 
ttcatgaacccatgaaagaaattgaagaaagatatcctaaatggttggaggacaacaaagct 
aagttgagcacggaagattatgaacgttacagacgccagtatgaacttataagagatctgaa 
caaagtttacgagactgaacctagcaacttcaacaaaattgtagagcttatgcagaaaatgc 
aagaatgtggccaaccgccaaatgatattgttcatgagcttgctccagactttgatatatca 
tctcttggacaactatccccagagatgttggagggccaacagaactgccgtgttatgtgaaa 
actgaaatgtccccccgcttgaatgtcctgcttgt-tttcgtcacctttgtcacagtttgcat 
acaacatttattttgct 
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ccacgcgtccgctatacctagatgacatttaccttagcccttaagccaaaaaagaaagaaaa 

gatccattgcctcatcctctgtaatctcatggattcactggtttcaatcctctgcatcttct 

tcttcttcaatataattcttaccccagttcatgctcaagtgatctttgaggatggttactca 

gttaagacactgattgatggccacaagatcaaaattaaccctcactccataatttctgtaat 

gggtgctggcaatttcatcattcttgattctgctgccagtactttttacaccttatctttca 

acaaaaactctgaattttctatttcgaagttaactggtagtgagactgctggctatgtggat 

ggttctctggataaggctaagttcaacaaacccaaaagctttgctgttgattcaaaagggaa 

tatttatgttgctgatatttggaacaagcatgcaattagaaagattagcaagtcaggtgtta 

ctacaatagcagggggttattcactaaagccaggccgtgctgatggacctggattaaatgcg 

tcattctcagctgattttgaactttcttttgttcctgagagatgcactttaatgatctctga 

ccgtggcactatgttagtgcggcaaatacagcttaaggccgaggattgttcaagagattctc 

attctgctctaagagcagtttctacatggttcttaaccgtggggcttccctgcttggtctgc 

ttgattctcgggttggtcatccgtccttatgttatccctaatgaacatggcagtcgtcttcg 

gcgcaacatgacatggaagcacttcctaatcagtctggagagacaagttctgatgttctgct 

tcggcatcagaagcgtagttgttgactcaaagatctattcapttttaaggaagctcgtgtta 

cttactttctcccatctgtgcctaatgtttagtcctaaagtagtagtatgccagacttctcg 

taaacaactggctcctctcttaagttttgacgactctgaaagcaaagaatcagcaaaatcac 

cggtggcagctaacattttggaggatttgataacttttgatggaagtttggttaactccgag 

ctgactactaatcaagatgatgcagtgagcaaaagtaccgatgtttctgttgtagatagcat 

gatactagctaatctaaaagggtttgcagaacaggggattgcttcttcagggcgtgaagttt 

catcgagcatttcgagcttagttaaccgaaaaaagaacgtaacttagtgaagtctagcagta 

gtatgtattactattaacttttgcaactgttctgaaagttcatcggtctatctgctaccact 

ttcatgtacatagtggaacaagcaaatgactcaaggcccttttgagttaatatttcctagcc 

tgtgttttcttggttccaaaaaaaaaaaaaa 

SEQIDNO108 

ccacgcgtccgcccacgcgtccgcccacgcgtccggaaaagaatcgcagtttcgaagctatc 

agaaaatcccaaacaacaaccatgtcttactacccaaaaggctaccacggggaagatgacga 

cggagctgaattcgacgagtacgatccaactccgtacggcggtggatacgacatcgctttga 

cttacggtcgtccgcttccaccctctgatgaaacctgttatcagacttcttcagcttctgat 

gaattcgactatgatcgtcctcagtactcttcttatgctgagccttctgcttatggtgagga 

ggctcttgagactgagtaccaaagctattctaggcccaaacctcggcccactccttcttatc 

atcgcccatctgaggaagaaggcgaagcttatgagcagcctcaggccgattatgggtttcag 

cctgggatgaatcgtcctggcagtggatatggtggggaaagtgaatacggatccgggtatgg 

acgcaagagcgagtatgaagaacccgcttccgaatacggatccgggtatgggagaaagaccg 

agtatgaagaacccaaaccggaatatgtagaacccgcttccgaatacggatctgggtatggg 

cgaaagagtgagtatgaagagcccaaaccagaatatggatccgggtatgggcgaaagagcga 

atatgaagagcccacatcagaatacggatctggatatgggagaaagagtgagtatgaagaac 

ccgctccagagtacggttcgggatacaggaggaagagtgaatatgaggagccaagatcggaa 

tacgggtcgggttatgagcgtaggaccgagtccgaagagtatggatctggtggatatggaag 

gaagcccagctacgggcaggaggaagagggggagaggaggcccagttatgggcgttcaagct 

accaga'ctgaggagggagaagggtacgagaggcctcgctatggaaggtctgaggaggaggac 

tacaggaagcctagctatgagaggcgtggtgatgacgacgacgagggctatggtcgcaagaa 

atatggtgatgacaactccgatgatgacgaggagaagaaacatcaccacaagcaccaccacc 

gcaaacactatgatgattgagcagtgtgctttaatcatctgaaccagatttatgccatacta 

agaactattacaaaataaaagttggcaagtttgagatacattttgtttgtgaatgtttgcta 

tgatggctggactgtccagttatttatgtgatgtattttgctcttctgcaaatcccagacat 



FIGURE 4 (continued) 



BNSDOC1D: <WO 030851 15A2J_> 



WO 03/085115 



66/140 



PCT/EP03/03703 



ttgtcagggttagtatgccatgaatgtgtgaactttatgatcatgatgactcttttatctct 
taaaaaaaaa 

SEQIDNO109 

ccacgcgtccgcccacgcgtccgcccacgcgtccgcccgagccagacgttaaacgacgtcac 

tttaatgtaccctttccccaaaaattggggctttgtaaattcatttgaacaattcacaaatt 

gtagaatttagggttctttttcagtaatggaaagtgggatttgtagtcccacaagatgaaaa 

gcagtcaagaagaagcgagccgcggcggcgcagcagatgaagaaagtgtgaaatttcacagt 

qttgtacagccacttagggatttagaatccaactggggtgttgatttagccaaaaatcttga 

agaatatttgctcaaaatttgctctggtgaaattactagagataattatgatgatggtcatg 

tqaattttgctqaagctgcattgctgcttcaggggtcagttcaggtgtacagcaggaaggtg 

gaatatctgtattctctggtattgcattgtttggaattcattaccaagaagagtgaaccaga 

tctaccagcaagtqtarcagcccaagaagatgaaaacggtttgcctgctgccgacaatgaag 

agaatgatccatac'tgogcttcagaagaaacctcagtggaagcaaagaacatgttggataat 

acgacgtgcagggartcttcatttacccagtttgtgaaggcccctgcaaatctggttgtacg 

cgaggctgactgcttgqatgttactggagatgctggagaactagagtcttacctgctagcca 

cgtgtgatctttaccgagattttattctgttggacgcatgtgatgccgtaacagtggatgag 

tttctgaataatgagaatatagctqgaaaggtgctgaacaatagctgcagtgcagagggcct 

ttctttggactccaaqtgccacaagagcttttactctcccacaagacgttttgagggaactg 

gcaataagtcttcagctcaaaagaatcaggatgctaatttatatcagtctcaagggtttcat 

gagtttggtccaggcaattttaacaatgatcagttcgcatctgatatgcctgattacatcga 

tgatgcacatagatgtgaagatggatattcagaacctagagactcagacgaatcggatgatg 

aagacccatggaatccgttgaacccgcatgaacctggcactttgaaagtaaaaccatacaaa 

aaagttaaatttaatagaaggcagggtgcggcgtccaaaaaagttgcatctttggctacaga 

atttccagttgcgagattacatggtaccactagcgcagacctcaacgacatgtgggagagaa 

aatgttgtgccatgaaaaaacaaggcgactcacaatctcctccaccatatgagaagctccgg 

gaatcacttcttcatggggagaacaacgattatgatggtttggatagtccaaaggaaaagaa 

tgaaaatgatgactatgatagtgcagatcacgattttgggccttctgcctttgacatgccag 

aaaatgctgacatgaacaccgatgcaactccttatggggaaaagcacgataaatgtagtcca 

ttttttgacagtgaagctcatgaagattcgaatgctcatgccaaccttgaagatctttgtcg 

ctcccacttggattctcttcttgctagccttgctgaaactgaaaagcagagtgaattggctg 

cacgggtttcaacgtggaaacagagaattgagcagaacttggaggaacaagaatcacatccc 

ccctttgacattcatgaatatggggctagggttttgtgcaagttatccctggaagaaaatgg 

tcaaagcaccaagtctttttctgatgttgtcacgggtcaagagaagcatgatattgctcgaa 

cattttctgcgcttctgcaattggtaaacaacggagatgttggtttggaaagaggtggaata 

cgtgagtccacttgttacacagctgcaaatcccttctatgtccggctccttaggaatgataa 

tggtagggagaaaatgcagattcggtcatcaagaaagagagcaaaatctccaatacccaatc 

agggctttagaaaggaaaaaaacaaaggtaaagaagttcaggctgctttcagttcatcacct 

tcagaacccaactcaaggttaccgatttgccctgaagctgggaaaggttaatggaactcgtt 

gtacgcctgaaggtaagaaaagaaggaaatccagattagtcgtaccaccagatatacatact 

qcattgtgatatacattttgctctagttttcaagtaagcctctcctctctcgctattcggtc 

tcactgtgcccgttgtatgtgagagactcaaggcagtaattctgtttgagtgtagtaagaca 

gaagattaaccccaccatgaccactgtaattcttatcacaaaccaacaacctgttggctgca 

gaaatttgtaagatgtgtttattcttaacttaattaggacttactaatagtttggagcaggg 

aggatgtaacaatattttgacatagtgcagagctactcatcatagctc 



FIGURE 4 (continued) 



BNSDOCID: <WO 030851 1SA2_I_> 



WO 03/085115 



67/140 



PCT/EP03/03703 



SEQIDNO110 

ccacgcgtccgcccacgcgtccgccaaaatccatcacgaattgcattttcagatacgtgagt 
caactgctaatgggagaacacttggctctatgtgttgatcgtcttatcacacctaaatcttt 
gcactcgttgcaagggtcagaggatgcaggatcctctgcaggaagttcttgctcgcacacag 
taggtcaatcaccttatggtactactaataaggaggatgaagaactagaagctggaggtgaa 
gatgagccattacttcagactgtggaatgccgaatttgccaggaagaagatagcactaagaa 
tttggagattccttgtggctgcagtggcagcttaaagtatgct cat aggaaatgtgtt cage 
gttggtgcaatgagaagggtgatataatttgcgagatttgtcatcagtcttatcaacctggc 
tatactgctccaccaccgccttctccttctgaagatattgccatcgacatcagtgggggctg 
gacagtggctggtactcagcttgacttgcatgatccgcggcttcttgcgatggcagctgcag 
agcgccatctcttggaggctgactatgacgagtatgctgattcaagtgctagtggagctgca 
ttttgtcgttctgctgctttaattttaatggcccttctattattgaggcatgctgtgaccat 
cggaaatgatgatggagatgacggtgatgtctccacctttttctctcttttcttgctccgtg 
ctgctggttttcttctaccttgctacatcatggcttgggctatcagtatcatgcagcgtcga 
aggcaaagacaggaggcaacagcacttgcggcaacagaagttgctt teat get gcaggcagg 
gcaacataggggcttgcatgtaacaatagcaccaggacctgcacagttagctgaaccttcag 
caacaccagcacacccaactactcatgttgcaacaccaaccgcccaggcgacatcccctcct 
ccagagatggtataaatggctttgctcagtttgcttgttataaaatagttgccgataggggc 
attttactgttggtaagttgcacaagatggggatgagtagaagggtagaggagtattctttt 
ccctttttgctttttcgatttattagctgtatctttgcattgccaaatttggagtgcagagg 
ctgaaactttttccatttgttcaatttttcattaatgcttgaacatgtaaaaatataatagc 
gaacttagctgctttcaatgtggagataccatatcttcacatcgtgtacat Lgtttatatat 
taccattatgggttactcttaaaaaaaaaaaaaa 

SEQIDNOlll 

ccacgcgtccgatgagacttggagggtgtcctctggtttagctgaggcatggcgagacaaca 
caaatgttgcatccaagaaaaagtcattttccattgaaactgaaattgatgatgaggcgact 
agttatgcgtctttgaacgaggacggtcatgactttgatgagattgaggatatgaggatacg 
cgggaacttgttttacaagcttgataaagattccaaggaatacgaagaatataagtttgaat 
tccatagaaggaatacgaacaagaataatggaaatgacggtccaaaagagaaggaaaaatcg 
aataacgtttcagcttctagggtcgagaaaggtctaaagggtatagatgagaagcagcaaaa 
caagaaagagaaactgagctataactctgcctctccgtttcagaattttcagctaaatgatt 
tcggagcatctccaataaagaggttaagggttccaacttttaatcagcttactgccccttat 
catgagccgttttgtttggatatttatgtgtcgaaaggttcagtaagtgctagcattatcca 
cagagctactagcaaggttgttgttgtggcgcactctatttcaaaggacatgaaatttgact 
tgggatcaactaagaatagagctacttgtgctgctattggggaagttctggctcaaagagca 
ctggctgatgatattcataacgtagtttatacgccaaggaaaggggagaaattggaagggaa 
acttgagattgtacttcagtccattattaacaatggcatcaatgtgaaggtgaagattaagc 
agaggaaaaccaagaaacctggcttccaccgcccgacagcttaggtggtcatcctacattac 
gtaggatgaaattaaaagtgacaaggaagttttatcaacgtcttataagctcgaaacagcgc 
aatgtagtaagtagaacaaggtcagagatgtattactacctcttttgcgaggttgcagaaca 
tttccctaaattcagtctttaaatcggtttcaatagtagttacaaacttgggaataaatctt 
ttatttcctgcaatttgtattctctttatgagaatacattgctgttaatgtaaaagtgtgac 
tegcag 
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SEQIDNOH2 

ccacgcgtccgcccacgcgtccggctggaactttgctgtatcatcttcaaactcttgattag 
ttatattaaagactagtctttaaactcaatgggtgatagtcagtactctttctcactcacca 
ctttcagcccatctggaaagctggttcagattgaacatgcattgactgctgttggatctggt 
caaacttcattagggattaaagctgctaatggtgttgtaattgctactgagaagaagttacc 
atccatcttagttgatgaagcatctgtgcagaaaatacaggttttgacgcctaatattggag 
ttgtctacagtgggatgggccctgattctcgagttttggttcggaaaagtagaaagcaggct 
gagcaatatcaccgactctataaagaaccaatccctgtcacacagctggtgagggaaactgc 
tgctgtcatgcaggaattcacccaatcaggtggtgtaaggccatttggtgtttcactcttgg 
ttgcgggatttgatgacaagggtccccaactatatcaggtggatccatctggttcatacttc 
tcttggaaggcttcagctatggggaagaatgtctctaatgcgaagacatttctcgagaagag 
gtacacggaggatatagagctcgatgatgctgtacacactgctatactgactctaaaggagg 
gattcgagggacagatctctggcaaaaacattgagattggcattattggaactgacaaagta 
tttaaagttctcacgccagcggaaatagatgattacctacaagaagtggaatagattttctt 
ttccgcttaaggcattggaaaaagttgtcaggttggaagcgcagacggggtcatagcacaac 
tattggatgttcttgttggcttgattatcacttgactttaatcaaactagacttagttgtat 
gttggccatgttgtggttatttattgcctgatgtatggctctgaaaagttatatgggttttc 
ttttctcagtttcttgaacatactgattgttctatgttacctgaaacacatgacagtagaga 
aaagcattatattatttgagcaaccctcttacgtctgagaacgg 

SEQIDNOH3 

cgtccgaaatggcggaagacaagaaagagtcaacgtcgagttcgccgctccaagaagatccc 
gaagatcccgtcaaatcccctccttcttcccccaattcctccactcgcaaggcttgctatgc 
tgttcttcaaagttgggtgtcaaagaagttcatgactggatgcgtggtcctcttccccgtgg 
ctgttacatttttcatcacttggtggtttattcaatttgttgatggtttcttcagccccata 
tatgaaagacttggtattgacatatttggccttggatttgtgacatcgataaccttcatatt 
ctttgtcggtatttttgcttcatcatggctgggttcaacagttttttggataggggaatggt 
ttataaagagaatgccctttgttaagcatatatactctgcatccaagcaaattagttctgct 
atttcaccagaccagaatactaatgcattcaaggaagttgctataattcgtcatccccgaat 
tggtgaatatgcgattggtttcataacatcttcagttgttctccagagagatgatggggatg 
aagagttgtgcagcatttttgtccctacaaatcatttgtatataggagatgtatttctggtt 
aattcaaatgatatcatcaggccaaatttgtctgtgcgagaaggcatagagatcattgtttc 
tgtgggaatgtcaatgccgcaggtgatttctcctatagaaaggatcacacgacagaccgacc 
ggatccctctaaacagaatgttaaagtaaacagaatcatcatctcatttgcttctggtttgc 
gctaagctaccataatctcatttttttagggaagtcgcatatgtatactgttggtcttctat 
gttcatttgatggttcagcagatctgaactggagcaattagcaacattggtgactttgttgt 
gtgtttattctttaggattagtaggaggagttctgtttgtcggaaacaaataggtagggagg 
cattgtttggctgtagctggtttactctaattaacatttcaccgtggtctgtacagtcttgt 
aacttatgagttcttgtgtttgtattataaagaggctatcagtgttatcgc 
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SEQIDN0114 

ccacgcgtccgcccacgcgtccgcccacgcgtccgcccacgcgtccggcagcctacagtcca 

tattcacgtgctacatcccctgccccaactttggggcatgatggccagctctatggatcaca 

acaataccactatccgtatttccagcccctccctccaaccagtaattcgtacactactccag 

ttgccctgccaaaaggtgagattgccacctctgctgctgctgctgaccatgcatcgttgtct 

gttgattctgctaatggaatttctaatggcattgccaatggtggtgtaaagggaaatgctgg 

gcctacgcttgtgaggcctgcattccagaacccatccgtaaatgctaatggttcttatggac 

ggggtgcgttgcctggaggagctgcttcaggttatcacgaccctagattaggttttgatggt 

gtgcgatctcccattccatggatagatggatcaatgttcactgacgggcaaggtaggctagt 

gtcgagcaattcttttacaccatctttttcaaatggcagtgccgttccatcatcaaaaaatc 

agaatgttcatccgcatctaatgggcttccaccacccaaggccctcttctggcatgaacaca 

acaaatgggtatatgaatagqatgtaccccaataaactgtatggggggcggtattgtaacac 

attcggtactggcatcggctttggatccaatggatatgatacccgtaccacaggtcgtgggt 

ggatgacggttgacaacaagttcaaacccaggggtagaggaaatagtttctacggtaatgag 

aacatggatggtttaaatgagctcaacaggggacctagaggtaaaggtttcaagaatcaaaa 

gggttttacaccagtaacgctggcagtcaaggggcagaacgttccgctcaccctaaccaatg 

atgctgagaaagaaaaaccaagcctgattcctgacagagaacaatacaactgtccagatttt 

ccagtgacatatactgatgccaagttttttataatcaagtcttacagtgaggatgatgtgca 

caaaagcatcaaatataatgtttgggctagcacaccaaatggtaacaagaagcttgattctg 

cttaccaggaggctaaacaaaagtctggtggttgccctgtttttcttttcttctcggtgaat 

acaagtggtcagtttgtcggtgttgcagagatggtaggaccagttgatttcaacaagagttt 

ggagtattggcagcaagacaagtggatcggctgctttcctgtaaagtggcacatcgtgaagg 

atgtaccaaacagcttgttgaaaaacatcacgctggagaacaacgaaaacaagcctgttacc 

aacagtagagatactcaggaggtcaaaatagagcagggcctacaggtgattaagatatttaa 

ggatcatattagcaaacagtgcatccttgatgattttgagttctatgaggatcgtcagaaga 

gaattcaggaaaagaaggtcaagcagcagctattccagaagcagtcgcaggtatgggaaggc 

aaagctactgaagagaagaagaaagaaaacacgaaagtggaacctaagtcccagaaaccttc 

agaagttcctgctggtttgaacaaggaaagtttacccgctgctccgactaatggggaggtga 

agcttacagaaaatggatcagttacaaagggagatgatatgaagggtgctaaaccagtcact 

gtagcggaaaagaaacctgtagctatagggatagcaaaaggagttgctaatggatgctagct 

tcacctaatgaagggggaggtctgtggttaaagaagccctaaattggagcttgttgactaca 

tgatatgcacgccagtgcttggttagatctcataaccattggactgccccttttatcctagc 

tgcatttggagttggttcttgcattaagaaatccccggagataaatcaatagtggcaaggct 

agttcaatctgtttctaagagttcaggaagtatggaagctccattttccctcaggttttagc 

ttctgacaggtttcataccttggtttgggttttaggataatttttttttataattttgtttt 

cgtcatgtggcttattttggtcaattttcccctttttttaaaagttatttgggttttaaagg 

gtggggttcttgttattattagtttggctcccaatcctatcttgtaaatctagatcaatctg 

ttgcggcagttcccaacattgcttttttgtactaatgattgagctagaagctagttttaaat 

gtcaagtctctaccgg 
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SEQIDN0115 . . 

ccacgcgtccgcccacgcgtccggcgggatttgtgagtatttgaatgaggaaaaagggtgtc 

cagtgtgatgagatatgagggttaagattgtaggaaaatgggttgtgtatttgggaaagaga 

tttcatcttctgagacacctaatggggaggttgtagttggaagtaggagagaaaatggggta 

gatagagatttggctgccccatctgggaggagagagaaagttggtactgtaaataaagtgga 

tgccgtaggtggcggcggaggaggtagtgatgttggtgaagttgtgaatggtagggatcaga 

aqgatgagaagaagggtgagaatgcaaggcataggggtgagaggagaaggtctaagcctaat 

ccaaggctaagcaacccacctaagaatgtgcacggcgagcaagtggcggctggatggccgtc 

atggctttctgatgtagctggagaggctatcaatggttggattccgcgcagggcggatgcat 

ttgagaagctagctaagattgggcaaggtacttatagcaatgtctataaagctagagataat 

ctaacqgggaacatcgttgcactgaagaaggttagatttgataatttggagccagagagtgt 

qagatttatggcaagagagatcttgattttgcgccgcttggatcatccaaatgttattacgt 

tacaaggattggttacgtcaaggatgtcttgtagtttgtacctcgtgtttgattatatggat 

catgatttagctggacttgctgcaagccctggaatcaagttcacagaggctcaggttaaatg 

ttacatgcatcaactattagcagggctcgaacactgtcataaccgtcttgtgttgcatcgcg 

atataaaaggatcaaatcttcttattgacagtgggggagcactcaagattgctgattttggg 

ttggcttctttctttgatcccaataaaaagcagcccatgactagtcgtgtggttactctatg 

gtacagaccaccagagcttctacttggagccaccgactatggtgttggtgttgatctttgga 

gtgctggttgtattttagctgagctattagctgggaaacccattatgcctggtcgtacagag 

gttgaacagctccacaagatcttcaagctatgcggatctccgtcagaagaatattggataaa 

gtcaaggcttccacatgcaactatattcaagcctcagcaatcatacagaagatgtatagcag 

aaacttttaaagattttccgccttcatcgttgccattgattgagactcttctggccattgat 

cctgctgagcgtcagacagctacaactgcattacagagtgcattctttactaccaaacctta 

cgcctgtgaaccttccagccttcccaaatatccacccagcaaagaaatggatgcaaaacggc 

gagacgaagaatctcgaagacaaagagctactgggaaagctaatgctgatggtgtaaggaga 

aatcgtcaccgtgatcgagcagtgagggcaatccctgctcctgaagccaatgcggagctgca 

agtcaatatcgataggcggcgtctagtaacacaagcaaacgcgaagagcaagagtgaaaagt 

ttcctcccccacaccaggatggaacattaggttataccttgggttcttcacatcacattgat 

ccagcctatgaaccctcagaagttccattcacatccattaatttctcatattcaaaagaacc 

gatccaaacgtgqtccggcccattggtggaacctgcaactggtgctccaagaagaaagacaa 

agccatcaaagaaggattctaacaagaaaggaaaagaaagcctgtaaagtctataatgaacg 

acgtgattctacaatggtatacttcaaggagctgcaaaacttacagattatttgtcctatac 

gtaaatcaagaagcttctcaacagcatagagaggtaaacaagcatttttatcgtagtattct 

cctttgtattcttttggataatgagaatcttttcattattgtacatgtaaattttgtttctt 

catattagcaggctctgtttagatcaataaaatcgtaacgctc 
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SEQIDN0116 

ccacgcgtccgccgttttccaacttccaatgcgcggcaaaccctaatcctcagctttggttt 
ttgcctcagaaaattcatccgtcaatttgacctctattatggggcgcagtgactctagatca 
cctgccaggggtcgtggatctcctcgtaagaggagcccttcacgcagggaaaggtcacctgc 
tcggaaaaagagttcacatgctgcaagttcagctgtagcagagaagccttcaaaccgtaata 
ggtccccgagacgtgcaaggtcaagatctcttgttcctctttcacctgcaacagagaggcca 
tctagtcgcaataggtccccaaagcgcagaaaatcaatctcccctgcatctcactccccagt 
cagagagaaaccctcgagtcgcacgaagtctcccaaacgagctaagtcaaggtctcctgatt 
cgaggttgttacaggtagagaagtcttcaggccgagtcaggtctcctagacgtgccaagttg 
cagtctcctgaatctcgctcaccctcaccacgaacaaaaagactaaggagagcagaacaaga 
gactgaagaaaagacaagggggcgcgagcctgagaaaaaccatgggagagctagtggtaggg 
ctgctctacatagggagaaggattctgatagaacagtgcctgaatcccgttcaccgt caeca 
cgaacaaaaagactaaggagagcagaacgagagactgaagaaaactcgagggagcgagagcc 
tgagaaaaatcatgggagagctagtgatagggctacacatagggaaaaggattatgacagaa 
cggtgcttgagtcccgttcaccgtcaccacgaactaaaagactaaggagagcagaaccagag 
actgaagaaaagttgaagatacgggagcccgagagaaatcatggaagagctagtgatagggc 
tacacataaggaaaaagattctgacagaatggtgcaaaatgaaaggagagagaaaagatcag 
gaaaggatgcactggataatggatcttctaagtcaagaaatggtcgatcagcttcaccttca 
gaacgtcagcataggagtcggcacagatcgagatcacctgcagcagcggacacgagagcacg 
cgatgagatgacaagctcaaggagaggtgaactcaggaatggtgatgatgactccttatcta 
aaatgcaggcggcagaggaggccttgcaagctaaaaataaagacaagccttcgtttgagctc 
tctggaaagcttgcagcagaaactaatcgagtaagaggtataacacttctctttaatgagcc 
accagatgctagaaaacccgacgtacgatggcgcttgtatgtttttaagggtggtgaagtcc 
ttaatgagcctctatatgttcatcgccaaagttgttatctttttgggagagaaaggagggtt 
gcagacattcctacggatcacccatcttgcagcaagcaacatgctgtcctccagtacaggca 
agttgagaaagacaatcccgatggtacttcatcgaagcaagtaaggccgtacgtaatggatc 
ttggaagcactaatggtactttcattaatgaaaatcggattgagccccagagatactatgag 
ctattagaaaaggatacacttaagtttggtaatagtagccgagagtatgtgctgcttcacga 
gaattcagcatgatgagtctctaaaatggttgacggaggtgtcatttgcattgattggcttt 
gacgtcagaagctttatcagatcaaatatttgctgtgccatgttactagcaggatagccgtt 
gtaagtgcttagccgaaatcgtgtaatgtggtagagatttgggcattgcttgcaaagttttt 
cactgctaatgaaaattttggtttatgcatcagtgatttatcctccagtttgtttataagct 
ctttgtcccctatatatgggatatgttattgttgattaggtcttaacttgtgaatgtgcgct 
cttttcttctaattattgaagatgctggagtgccccc 
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SEQIDN0117 

attagatgcgtttggctttgactcaagcccttgaatcaaaactgcaatcaccgttgtggcag 

gttcgtgtgaaagcaatctgtgtcctcgaggctatcttgaggaaaaaagatgacgagcactt 

tggtattatggcatcttatttcaatgaaaataaagatgttgtggtgaaatgctttgaatctc 

cccaagcgtcattaagagaaaaggcaaacaaagtcttaagtcttctgaatgatggacaaaca 

gctgattctgtgcctcatgtagataggtcagcaatggctggtgcccctgttgttcagatgcc 

cgacttgatagacacaggtaattccgatgatctgtttggagcagatgatttagcaaatatgc 

agagtggtgaagggataaaaattgcatccacctctggcgcccctctggttgatgatctattt 

ggagacaatttgggtggcggcgtggcttccggccagcagaaaaatggtgatgacccctttgc 

tgatgtctcatttcacaccagtaatgagaaggcgcttgaagctgatcacttttctggaatga 

catttgataaaacagatgctactgaagtccatttggctgtcgatagaactggacctgaactg 

tttgacatgtttggtcccagtgttgaagttccccaggatcccaataatcctagaaaggagat 

tcacgatttaatgaatagtctctctttgaatgggaatgactcatctaagaagcagaatggca 

gctcaaggggaacctatccggatatgtttcaagagtccactattgatcctcatcaggcttcg 

aatgatgccttgaacagcatattttcctcccaggctggtggagcaaattcaaatcccatgtt 

tcctttgggtgctatgcagtataacttgcctcctggcttcgtattgaatccatcatttgctc 

ctcaggctctaaactataatgccatgggtaacatgtttgctcaacagcagttctttgcaaca 

ctttccagttaccagcaattagctaccatgcacccatccactagtgctagtcatgccgctga 

ttctgctggaggttatggttcagctcttcccgatatcttcaatcctagcatttctaatcata 

gtcctacttccttgatgaatacttcaaagaaagaagatacaaaagcatttgattt.tatctcg 

gtaagtttggtccttgtgtacttcaatttttctattattactttgaagatgcattgaattgt 

gacggccctagtgtgtccctgattttgaaggtatgccattaacaattatgccttttgtttat 

ctattttatttggcctaaatccctccttcctccactccaaaaagatgaggtcctccgccttt 

attcttgtggtgataaatgaggtccattgcattgtccttttccaggcccccagtcatgtaaa 

gaataagtggacttggaaatactctggcatcataatcagctattccttttctgttaatgtac 

ttagatatctcattgtagggcttctcagtgcttcatctttttttgtcaatgttgtgagcaat 

aaagtttctcagttctgattgtgtgcaatatcatcttttccaaactgagaagactagaaaac 

ttcatttaggactgtatgactctaattttgttgccatggtggattccctgtgttttttgcag 

gatcatttggctgcagctcgtgatccaaagagggtgatttgagtggttatagctcaaagcaa 

cccagagtatgctgcttataagatttagctatgcacaatttgaagcaggagtaatctgtaaa 

ggttcttttgaagcagtgatatcaatgtgaaatacagtattatttttttttt 
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SEQIDN0118 

ccacgcgtccgaacgattctccctcgtaacttcattttcagtcatggcttctgctactaaga 
aggtattggttccgattgcgaacggaaccgagccgattgaggctatagtgcccatagatatt 
ttgcggagagctggtgcggaggttactgttgcctcagttgagaagcagcttcagattgaagg 
agtgcacggaattaagatcgttgctgatgctctaatttctgattgtgcggatactgaattcg 
accttatctcacttccgggagggattcctggtgcaaccaacctaaggaattgcaagactttg 
gaaagcatagtaaaaaagcaagctgaaaatggacggttttatgctgcaatatgcgctgctcc 
tgctgtagcacttggatcatgggggcttctgaaggggctgaaagcaacatgttatccgtcgt 
atatggaggaactatcatcttataccattgctgttgagtcaagagtccaaaaggatggaaca 
gttgtgacaagtcgaggaccaggaactgcgatcgagtatgctgttgcattggttgaggagtt 
gtatggaaaagagaaggctgatgaagtttctggcccactcgtgatgcgcccaaatcacatcg 
aagaattcgcatttgctgagctcaattcagtaaattggacatttacgagtaagccacagatt 
cttgtacctattgcgaatggttctgaggaaattgaagcagctactattatcgatgtacttcg 
acgagcaaatgctcaagtagtagtggcatctttggaagatacattggagattgtcgcttcca 
gaaaagttaagctagtagcagatgtgctccttgatgaagctgctaagcagtcatatgatctt 
atcgtcctgccgggtggtcttggcggtgcccaagcatttgccaactcagaaaagttggttga 
catgctgaagaagcagagagaatcaagcaaaccgtatggagcaatgtgtgcatctccggctc 
tagtcctagagcaccatgggttgctcaagggtaaaaaggctactgcctttccagctatgtgc 
aataagctctcagatccaagcgaagcagaaaatagggtgttggttgatggcaatcttgttac 
tagcagaggaccaggaactaccatggagtttgcactggccattgcagataagtttattggcc 
gcaaggaagtactagtgctagcaaagaagatggttttctaagtagaatcattttgcctgtct 
tccgtctttaggattatataggcacccatagttacccatagttgtaaacttgtaataaactt 
tggccatcagtgtgcacttaaaataaaagaaatgctgtatacgagttacactcagtcgcatt 
tgctaatttcctattcaatgccatcgctttttaaaaaaaaaaaaaaa 

Group 3 

SEQIDN0119 

GNATGNCTGACCNAGNTGNGTGCTTAAGTNTCGCANGCNNCTGTAGTGNAGGGGACCNNNCA 
NTNTCTNCTNGACGNCCGCAGTAACCAGNNCTCTNAACCNATGCATNATGCAGATNCAGGCT 
TTNCAGTCTNTTANGGCTCAATTGGTGTATGCAAGNTCCAGGACATGGTGTACGATCAGATN 
ATGATCTGCAAGCGAAGAATTNTGNTTTNTCAAATGCTCTTTCAAGCCCTGTTCGACGAAGC 
CTGCAGAACTATCAAATTGCTCANGGAGGTTTCCTT 

SEQIDNO120 

ACTCGCGGACGGAGAACAACCGAGAGAANGGGAGACATNGNTCCAAANTCGTGGACTNCCAA 
TGTACTGNTGAGCACTNNGTAACTNATNATNTGGNTNATGAGGGCNNGGCANGAATAGGCAG 
NACGCGGGNGCAAACCCTGCGAATTGATACGATCAGATAAAAGATCATCANNATGGANAGGN 

GCNGNTGTTCTGGGGT 
SEQIDN0121 

TNTGGCTAAGAACTGNNCTATANCTAGNGACANTGTGCTATTCGACTCACAGGAAAAACTAC 
TAAAGGATTCTCTAGATTNGCATAATTTCCAAGAGATAAAAAAGTTTTTCCATGCTCACTTC 
AAGGTGGATCGAGAACTTCAGGCATCTGTTGCTGTGTATGCATTGAAAGGCCAGNGATTCTG 

TTCTTGATGAGCAAAATAGAATCTACAAGTTGGTGT 
SEQIDN0122 
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GGNGAGGNGACGGNGANTGGAGCNGTAGTGTCGCGGGAGGAGGGACAAAAGCTGNANGNAAG 
AACAGCGNNACAAGANTNACACCTNCTGNAATATANT 

SEQIDN0123 

AATAGCAGCGGCAGCATACGACTACGAGAACGATCCGAGATGGGCAGATTACTGGTCCAACA 
TNCTNATTCCTCCTCACATGGCTTGCCGTTCCGACGTNGNNNACCACT 

SEQIDN0124 

TTNAGNATCCGNAAGTTGAGCAACAACTTCAGCACTCGNGCAGAGAATGGAAANTCGATGAC 
ATTGTGNACGCTNGNGGCAGTGGGGGTACGGACGCTGGATTGTCCATTGCATCCAGGCTCAG 
NGGC 

SEQIDN0125 

ACTCCTTACAACCAGCAGATTGCAAATTGCTGCAAGGGAGGAGTGATCAACTCATGGGGNCA 
AGNATACTGCAACATGCTGTTAGCTCATTCCAAGTCAGTGTNGGTGCTGCCGG7^ACAACCAA 
T AAAAC AG T TAG AG T T C C T AAG AA C T T C A C C 

SEQIDN0126 

GCCAGTCATTCGTTGTCCCATAAGCCCAACAGCCACCCAAAACCATCCAAATCTGAGCTGTT 
NTCCAGCGCTAAGCTAGTGGCCGACGCGGCCAAAGCAAAACTCCATCACGAGCCAAACAGTA 
AGGTCGACAAGTCTGAGCTCGCCGGAGCCGCCGCTGACCTCC 

SEQIDN0127 

AGAGNCAAATCCCACATGGGGCATCTGGCTTGGGCGGATGCTTTTGTCATCACAGCAGATTC 
ATGTNTAGACATGTTGAGTGAGGCTTGCAGTACTGGGAAGCCTGTGTATGTAGNTGGAGCTG 
AACGCTGTACGTGGAAGCTCACAGATTTCCACAAGACACTCAGAGAGAGGGGACTGACTAGG 
CCATTCACAGGACTTGAGGATATGTCAGAAAGTTGGAGTTACCCTCCGC 

SEQIDN0128 

CAACAAGAGGAAANTGGAGTTCAATTGACTTGGGAACTGAAATTTATAGAGACGCTAATCAA 
GTGGCTGAGTGGACTGTCTCTGATTTTGACATTCTTGTACCCAATAACAATTAGAAGTA 

SEQIDN0129 

CTTTTGCTAGAATCTTGCAAGCTGGACAAGGTGAAATGGTCGGGAGTGAAAACACAGCATTC 
AACAACCCCGTTCGTTGATGAAATGTACGAGCGCCTGAAAGAAACTCTAACTGACTATGAGG 
TCATCATCTGCCGTTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCTGATATTGAG 
AAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDNO130 

CAGAAAAGGAGGAT^AAAATGAGAAAATATCTTCTGCTTAGAGTGTTGTCAAAGCTTTTGCCC 
TCACTGCCTTCCTTCTCATCATTTTTGTCCTCTTCTCTTGGTCTCTCTCTGTATAATTATGT 
AGTAGATAAAACTTCAAGTATTCATTTGAGGTTTTTGTTTCCTAA 

SEQIDN0131 

CTCTNCGTCACACGAANNAGTACTTGACAAGGGAGTTAGTACTTTATANNGACGACANTTTA 
GNCN 

SEQIDN0132 

NTNCCATGTTNANAAAATNCAAGCTCTGAATGGAAACGGCTTGGGTGCTGATACTTCTTCCT 
TTGGTTTCTTGGGACAGATTCCTCGAAACTTCAGNTTGTCGGAC 
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SEQIDN0133 

GTAGNATTCTCCATTTGAAAATACATAGTGTCAGGCCATCTGGGACATCAAACGGAGGTGGA 
GAGACTAGTCTAAGGGCAAGGAGACCACCAAGCCAAGATCAGGATGCTGCATTAGCATTGCG 
ATTGCAGTATAGGAATTCTCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAG 
CCA 

SEQIDN0134 

NNNNGNAT.NTNTCTGNTATAATCTTGCAAGNTGNACAAGGTGAAATGGTCGGGAGTGAAAAC 
ACAGCATTCANCAACCCCGTTCGTTGATGAAATGNACGAGCGCCTGAAAGAAACTCTAACTG 
ACTATGAGGTCATCATCTGCCGNTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCT 
GATATTGAGAAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDN0135 

TCTGGGTATCATTTGGGCTGTTCGGTACAAGACGGATACGGAGAGTCACTTANGAGAAACTG 
TTNGGAAGAGAGAGAAAGAGGCGGGAAACTATTGGCNAGGTGTGTGGAGGAATTGAAGAGAA 
AAGGGGTGGAGTTTGATTTGTTGAAAGAGGTTGNCGCTCTTAGGAGGGCTANNAGTTNGAGG 
GTTGAAACTAAGGTTG 

SEQIDN0136 

GCACGTTTGGCTCGTCTGCTCTCCCGCAAGAAAGTGCGAGTGATATGGATATCAGTAGTTCA 
GTACTTTTGTTAGGGTCAGCTTGTTGGGGAATCCGGTTCTTTTGTTATTAGGTGGTAAAAGA 
AACTTTTATGTCGCTG 

SEQIDN0137 

TTANGNGCCAATGTTTCAATACACATTGCCCCCGCCATGAATATCGGAACAATGACACAATT 
ATTTGATGTAGCACAAGAAGAGTGTTCAGNCCTTTTCTTGTGGACTTATTTGGTTGCAGCAT 
TTGCACTTACTATTTGGTCCACTGTATTCATGTGGCTCTTGTCCTGATTGTCACAAGAAGAC 
ACAATATGAAGATATTATATAATGGTGTGGTGTGCT 

SEQIDN0138 

TGAACTGAATNTGGTATCTGTATTACTCCTGTTGTAATGGCATTGGACTTATACGGCCTTGG 
SEQIDN0139 

TGTTTGTCTTCCGGCTTTGAGGGCTTTCCAGATGGTACATATGAAGGAGTTGAAACTGGATA 
TGACGAATTATTTAGGACGCCATCATCCCTCACGGAGAGCCCAGACATTCTCTGG 

SEQIDNO140 

TTGAANNCCCNTTNNGANGCACCACAAGNTNNAATCCTTNCTGTAAATGGTAGCAAAATCCT 
ACCCGATTGGGGATACGGNAGAGTTTATACTGATTTAGTTATCAATTGCACTTTCCCTATTC 
CAGTTGGNACTGAAAATGGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNGGCGGNGAC 
ACTAAATTNAACACCGCCGACACTTTTNTAGGG 

SEQIDN0141 

CANAATACCCCTTNAAACGACCCGAGTCTCAAAATCGANGAGATTGGAACACTTTCGGTCAG 
TACTTGAGGAATCAAAGGCCACCAGTTTCCATATCCCAATGTTATAGCAACCATGTCCTAGA 
GTTCCTCCGATACCTGGACCAGTTCGGGAAAACTAAGGTTCACTTACAAGGTTGTATCTTTT 
ATGGACAGCCCGAGCCGCCAGCTCCCTGTACTTGTCCG 
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SEQIDN0142 

CTNGTGGNANTAAATAACATTCTCATTTCTTTCCNNTNTTNTTCAGGTCCAGTACCACCAAT 

GGAGNCTTTCCCTATNAN 

TAATTCATATAGTCG 

SEQIDN0143 

GNGATGTCTNCNTATTGCACATCTGNTGTTGATTCACTTTATGGGAATGCTCAGAAATTTCA 
AGCAATCGAGACAGATAATCGCAGGCAACGAGCAGCTCTAGTGACCTTACAGGAAAAGGTAG 
ATGCTGTTGCTTACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCG 

SEQIDN0144 

GAATGGAAAAAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGGCTACTT 
GTAACCACCCTGCTAGTATTTGATGAACTGCTGAAGTCTATAAGGCCGTATGGCACTCTTAG 
TTTGTTGTCCGCGCTGATGTGTGAAGTCA 

SEQIDN0145 

GTTCCTTTGCCATTTCAGCTGCCGGCGGCAGTATCTTCACCTGGGACGACGTCTTTCAACTC 
CCCGAATCTCCTCAAAATGACTCTTCTACCCTCTCAGCTTTCTTCGATAAAATCAAGCTCTG 
TAATCGCAATTCGGAGAAGCAATCCGAGTTCATGCCTTTCGTGATTGAAGACCAAATAATNG 
GATACGTACACCACGTGGTTGCTGA 

SEQIDN014 6 

AGANTTTAACTCNATCCATTACTGTANATGGNATGCAAAATCCTACCCGATTGGGGATACGG 
NAGAGTTNATACTGNTTTATGCTATCAATTGCACTTTNCCTATTCCAGNNGGNACTGANAAT 
GGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNNGCGGAGACACTAAATTCAACACCGC 
CGACACTTTCGTAGGGTTACTCAGGACTCATNAAGAAGG 

SEQIDN0147 

TTTNCACACTAATTCCCCTNTATCTACNAATGAAGTGCGTGGGCTTGCAGTACCTGGAGGCC 
ATTCGTAAGCTCAAGGCTTCTGGCTTCCAACCAACGCGCACTGTCTACCTTTCCTTCGTCCC 
CGACGAGGAAATCGGCGGNGNCGATGGAGCCGGAAAGTTTGTCGATTCCGATGTCTTCGTGA 
AGATGAATGTTGGGATTGTACTTGACGAGGGCTTGCCTTCTCCCACCGAAAACTATCGTGCA 
TTCTATGGGGAGAGGTCCCCCTGGTGGCTGGTCG 

SEQIDN0148 

GNTCCGTAAAGTCCCCAGNNTNCNCGACCCGTNACTCNGGAGTTACAGCGANACANGTGGCT 
GNATNATNNGACATACTCAGACCTANTTAGCTTTGATATAATCCGTGAGGGTAANTTCGTTC 
TTTGCAANCAAATGGACGAACCTGGTATGTTTAGCCTAATTGCAAACAGGTTTGCTGATGCT 
TTTATTTCATGGGTTTCAATTTGTACTAAAGCTCACTTGCCGTTCTTCATGTACTAGAAAAC 
TACATATGTCTATGACCCTTTACCTAGTCTGGTAATTTCAAGGCATGAGATTGNGATTGATC 
AAAACAAGTTGGAG 

SEQIDN014 9 

GGTAACTCCGATATCATCGAGAGCCGATACGCACTCACAAAGCGGCAAGGTGCTCGCTATGT 
GCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTAAAGTCCAGGGANGAATTTGTCCAAC 
TATTTGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCCGAAG 
AGGTCAAAAGCAGAGATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCGAAGT 

FIGURE 4 (continued) 

BNSDOCIO: <WO ' 030851 1 5A2 J_> 



WO 03/085115 



77/140 



PCT/EP03/03703 



GCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTACAGGN 
TTCTGCAAGAGAAGTTTGAGCTTNAGGC 

SEQIDNO150 

TTTTNNCACTTCTAAAACCCTCGTNTGANNCTGCNAGGCATGTGAAGNTGTCAAACTCAAAC 
CTTATGCCANAAAGTGCAAGAAACTGATCTTTGAATATGCGCCAGTGATTCTCGTAAATGCT 
GAACAGTTTCTGGAAAAAAATGACGTATGTGCTATTCTTCATGATTGAGAGCCTGCAGCAGA 
TAAAGAGCTACAAGCATCACCAAAGATGCAAGCTTCATTGCATTCGGCCTC 

SEQIDN0151 

CTGNCCCTATCCGATCCAATAGTTGACTCAAAGGTGTTGCCTATTCCAGCCGGAGATTTGAG 
TTTTGGTTCGGGTGCACAACTGAAAAANTCAGTTGGNAATTGGTCAAGATGTCTTACTGATT 
TGTTTGGCATAGATGCTGAAGATTCCGGNCAAAATGATGAAGGCAGCTTCGGAGATGATCAN 
AGGAAAGGTGGAAATCAACCAGAGCATTTCCATCTTCTCAATGCC 

SEQIDN0152 

AATCCTGTTCAATACCAAAATCAAGCAACAAACGGTTGGCAAGCTTCTTGGAAAAAGTTTTT 
GATAGATATGATGTAC 

SEQIDN0153 

TCAACTGAGAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGCAGAAAGCCTTA 
TGGGAGTAGCTCCGAATGGTTTATGGATCAAGACACGTCTCGCTGGTATGTCTGGGATGACA 
TGGCACAGGCCTTTGTCAAACGGTTCCAATACAACATCGACATTGCCCAGACCACATTTCCC 
TTTCAAACCTGAAGAAGAAACCAAGTGAAAGTTTCAGGGAATATGCCA 

SEQIDN0154 

CTGGGTGAAAAAGCTCTCCTTNTGCCTTNCCAGAAGCACCTAGCGCAACATGTAATGGATAG 
ATGTGCTCGGNCCATGGATGTGCAACTTTTGCATGCGGAGCCTTCATGTCATAGTTA 

SEQIDN0155 

CGCTGCTGTGNGTTGAATTTTCTCCCATTTTTTGGAGAGGTGTGTATCTGGA 
SEQIDN0156 

AATGGAGAATGGAAAAAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGG 
CTACTGTAACCACCCTGC 

TAGTATTTGATGAACTGCTGAAGTCTATANGGCCGTATGGCACTCTTAGTTTGTTGTCCGNG 
CTGATGTGTGAAGTNA 

SEQIDN0157 

GNTATGTTGCTGATCAATCTNGTTATGGCATGGTTGATCCTTCTCAGCATTATTATCCGGAG 
CAACCATCCAAGCCGCAGCCAAGCATTTCGAACAGTCCTTATGCTGAGAATTATCAACAGCC 
ATTTGGTTCTTCATACAGTAGCGGCT 

SEQIDN0158 

TACCCAAAAATAAAAGTACCATCCTGATGCATCCTAATGTGCTACATATTGCAATCTTCATG 
GGTAAAAGAGGTCATTTGGCGGACCAATGAGGT 

SEQIDN0159 

ACGGGGCCTCNAGGCTAATAAACAAACAGAAAATGAAAATTCTTTTGAGAAAGAGTTGCTAA 
AAATGCAAGAAAAACTTCAAAAGATGACACTTGAGAAGGAGCAGACTGAGGAAATGTTGAAA 
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GCTAGAGAGGATATGTTGAAGCAGAAGGAGGAAGAGCTCGAAGCTCGGGATAAAGAGCANGA 
AAAGCTTCAAATTGAACTCAAAAAGTNGCAGAANATGAAAGAGT 

SEQIDNO160 

TTTTNNGANGTACTAANNCNCATTNTAGCCGACCGCACTCACAAAGCGGCAAGGCGCTCGCT 
ATGNGCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTAAAGTCCAGGGAAGAATTTGTC 
CAACTATTNGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCC 
GAAGAGGTCAAAAGCAGANATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCG 
AAGTGCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTAC 
AGGTTTCTGCAAGAGAAGTTNGAGCTTNAGGC 

SEQIDN0161 

TAATCNCACNAATNGAGGCCCTATGCAAATCTCNTTCAAGTGGAGTTTGCTTCATACTTGCA 
TTATTGTCACTNT 

SEQIDN0162 

TTATTGGGTAATTCCCATCTACTGGGTCTTTCTCAAGATTTNTCAACTGCATCTGTAGCCAG 
T G AT AAC AAG C AG GAT G 

SEQIDN0163 

AAGTAAACTCNCGGTAGGGAAACTACNNCGATGAAGGTCTTCAGTCAGCTGAACTACTTGGA 
TGTGCTCATATCCGTTNGAATGAGCTTGAGCCTGGTAAAGTAAAGGNTATTTGG 

SEQIDN0164 

GNCGCGANTTCCTTCGTCCAAGACTGANGCTTTNTANTTCAAACAGGTAGTTCAAATGCTTA 
CTGGGTCCTCTGAAACCGCCAAGGTTGCAGCTACTCCGGGTCGGGCTGAGCCNGTTAGACAT 
CNTATCCCGCCCA 

SEQIDN0165 

TGCTGTGANTCTTTTGCTACATATGCCTACGATTTACGAATNTGCAAATAATGTTGGATTTT 
CCAGGCATCTTCAGAATTGCCTATGGTGCAAATTGCTTTTCAGTATACTGTTGTTGNCCCAC 
CAGATGAACTTGCAAATGCAGGATCAAGTTCTACAACAAGAACAAAGCATTCCCTCAAAAGG 
AGA 

SEQIDN0166 

GCACNTGTCGAAAATCAGGATTGATGTCAATGCTGATCAGCACCCCTTTCAGTACAAAANCT 
AAATCAACCACAGAAGCCAGCTAAGGTGGACNTGAANTCCGCAGTTTATCCTGGCGGTCCAC 
CTTCACCGGCAAGGGCGCCAAAGATGTCGCACTTTGTCGATACAACAGAAATGGTAAGAGGA 
CCTGAGGAGTCACCTGGCTACTGGGTGGTAACTGGTGCAAAGCTATGTGTAGAANATNGTAG 
GATAAGAATGAAAGTGAAGTACTCGCTC 

SEQIDN0167 

CCTTTTNAGGCCACGNNTNGGAGCAGCAAACACAGCAAATNTAGCGATGAGANAGCCAGTAT 
TCATAAAAGTGGATCAGTTGAAACCGGGAACAAGTGGTCACAATCTGACGG 

SEQIDN0168 

TTTTNCTTTGGATACATGGCTTGCATCTGCTATGGTTTCTTCCTCATGCTAGGGACGTGTTG 
GTTTCCGCTCATCCATGCTCTTTGTTCGTCACATTTATCGATCTATCAAGTGCGAGTAAACA 
TTTTGTGTAGTATTTGTTTCCTCCACTTTAGCCTCTCTACTTCTTCGGGGTAGATGAAAAGT 
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CTGCGTACACACTACCTCTCCAGACCCCATTAGTGGGATTTTACTGGATTGTTGTTGTTGTT 
TACACCACTTTGGTTATACC 



SEQIDN0169 

CTCCTACTCCTCAGTGTTTCTCAGCCAGCCGTGGAACTACAAAGGCCACTCCATCTAAGGCA 
AAGTATAGACCTCTGGAGACAAGGGGTATCCTTCAAGAACTGGAACAGAGCAGCAATGAAGA 
GAAGAGAAAGGAAGATCAAGGGAAGATGATGAGTAATAATCAACAAGGACAGAGAGGTGGTG 
CTATTGTTGCTGAAAAAGAAGCTGCTGCTAGAGCTTTGGATGTCTTCTGGTTCTTGAAACCT 
TGCACTCTTTCCAGCTGAAATGGTCAAAGCCCACTGCTGCAGAACATTTCATGAAGTGATTC 

TTTCATAC 
SEQIDNO170 

GTTCCCTCCTACCAAACTTGAGGAAATCAAGTCTATGCACAGCCCACAGTTAGCACAAAGGG 
CTTACAGCCAAGAGTCAATGTACTAAGAGGAGAAAGGAGCCCCAGATTGACGGGCAAGGGAC 
TTGAAATAAAGCAAACTCCTAGCCCCCAGCCATCTAATCTGGGTCAAAATGGTCGTGGTCCG 

TCTTCTACCTAGTA 
SEQIDN0171 

NTCNAGANTGATCGANCAGANGGTGCNGATGATACTTTGGNAAGGCCTAGTGAAGAGGNCAA 
CTCCAGATAATGAATCANAGTTTCAGGTGGAACAAGAGAGAGAGCANNTAGCNGCGGACGAA 
AGGGAAGAGGGAGAGCTAATTGCTGATCCTGAAGATGTTGGAAATNTCGAGGGAGTNAGCAA 

TTTA 

SEQIDN0172 

TAAGGNCANNCNAAGNCCANCAGTGCCATNACGCNNATTGCCTGACTGTTCANTGCCTACAN 
TNTGCNGTANTTCTAATGGCGANCTGAAAATGGCCAAGGNCCCCNAAACCTAGAGCTNTGTC 
AGTAGANTNGGGTNTATATTTGAATTNGATNCTGTTGAGTGATAANGATGGTGGACNCNTTG 
TACCTNTACCTGANTGCAAATAANGTNTTGTCATCAACAGANNTTATGCTA 

SEQIDN017 3 

GTTAAGNGNGGCNANGAGGAGGCTGTTTCNATGCAGNNNGTCTGGNCTATCNNGTNNTNTNT 
AGGNNNANATCCTANGNCTCACCTGGNTCTCTTTAACCCTGAGNATCATTNCACCACTTTNA 
CTCAATNTNCTCAGNCCCANCTNTTCCTNTCAAAATTCGAAATTATTGTNCCCATAGTATAT 
ACTCTGTTTCTGGTCCTCCTTTTCCTCTGTGCTGTAGCCACAATTACATACAGCACGCTTCA 
TGTATCCTATGGTAGACCTATCAACCTCGTTTCCTCTA 

SEQIDN017 4 

GTCGNGCAAAAGAAGTTGTGGCTCACAAGTGGAAGCATCAGAGATACAGAATAGACAGTGNA 
GTTTGAACACTTNTTCCTGATTTATTTTCTCTCTGCCTTTAGGGA 

SEQIDN0175 

ATGAGAAGCCAAGGAGT.CCCTAGTCTTNCGNGCTTGTTACCTGGGGCCCAAAGGAGCAACTA 
GTGACTCGNA 

SEQIDN017 6 

NNTCATCACCTATGCCACTACTCTTCTTCTGGAGAGGCGTGGGAAAGAGATTGTCTTGAAAG 
CANTGGGCACAGGCAATTAGCAAAACAGTTGCTATAGCAGANATCA 

SEQIDN0177 
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TTCGGCCAAGCTGACGCTTCCTCTTTCAAACAGGTAGATCAAATGCTTACTGGGNCCTCTGA 
AACCGCCAAGGTTGCAGNTANTCCGGGTCGNGCTGAGCCNGTTAGACATCCTATCCCGCCCA 



SEQIDN0178 

CGTNGNCCAACTCANGGCTGAACAAGTGATGAAGNGCCCTTCACGGGTTCCTCTGNAAGAAC 
CAGNGGCAGTTGGTGGTAAACATATGTCAAAGTCTNCAAGTATGAANGGAATCATCACCCCT 
GCGCCAAGGTTGAGNTTCTCCCCTTCCTTACCTATCACCCGAGCATCGGNTTCTNCCTCAAA 
GNCTTCTACGCAGCCCTCGTCTCGTCCTTCA 

SEQIDN017 9 

CGNGTGGAAATCGCCCGTAACTGACCTGACATTTCCGGAGTTTACTGAGGAAGAGTCAACGT 
GGGACATGTGTTCGGATAA 

SEQIDNO180 

NTACTGGTACTNGAGCGGGGGAATTTTTCGATTATCTGCTGGGCTGATTCCATTCAGTAATG 
CTTTCCGGGAATGTATAATAAAATGCCGAGTTGGTGACTGAGAAGAAACCTTGTAAATAAAA 
TCACATAGTTCGTTGNANGAGTCGTGGTCATCAACTCCAATTCTGCATTTCACNCTCACTGG 
AACATTTGTATTTGCAGCAATTACNCACATGGCCTCAGCAACAAACTTTGGATCAAGCATGA 
GTCGCACACCAAAACACCCATGTCCAGCTACTTTAGGGCTAGGACATCCGCAA 

SEQIDN0181 

AGTGCCCATTTTNTCAGGTTGNANAATGAGCACCTTGNAATAACGTCGGGAAATATCGTGCC 
TTCAAGTCCGTACCATTGCCTT 

SEQIDN0182 

ATTCCCGGTTTAACCTCCAGTATCCTGTTTTNCTGATGAAGACATGCTAGAGGTCCCAACAT 
ATGCTTTAGAAGGCTAGAAACTTGTGAAGATAGCAGATGGTCATAATTGTAAAACTTGGGTG 
TCATGAAAATATACGTATCACGACAATGACTGGTGGCTGCAAAGTTGAATGTGTTGCTGATG 
GTGACTGTATAGTTAGTTGAATGTGTTGCTGATGGTGGCAAGAGAGGGTTTATACTTTTGGT 

TGTGTT 
SEQIDN0183 

CCNNNGNAAATCCCATACANATNTTGCNCTAAACTTNCTCACCGAAACAAACCTGGATGTTC 
TCTGNGAACTGGNTGATNTNC 

SEQIDN0184 

CNTTTGGAACCNCCCTCAAGCTGAAAAAGAGGANGGGATTATACCTCTTACCGACACATTGC 
CAATTATGCCACAGGNTTTATGTCACCATGCCC 

SEQIDN0185 

NAGCGCTTGGTCTTNCCTCACTGCTTNTGGTGNTNAATCTTGGNNTNTGTNCCNAGNCGTCN 
TCNNTAACATGAAGCNTANGGTNANAAAGGNAGAAAAGTNTNCTGCTNGAAGGGCTATCTTG 
CCCNTGCTTCANGCTGAAGAGGATGAAAGATTCGT 

SEQIDN0186 

GCTTAGANNTCTTGAAGGTGACCTGATAATGGAATCTGGCCAATTGNCTACAACACCTAGGT 
ATGATGTGGGAAGCCAAAGTGGAAGGATTTTGTCTGATCACTACATNCAGCNTCATAGGTAC 

AGNGNCTCNATACTAAANGATGGGTTGGAGGGA 
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SEQIDN0187 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
TCACAAAATGAACANCAGC 

SEQIDNOI88 

GANAGCCCATGCTGNTNTANNANAAGCTGCNGAGGCTGACTTGGNGGNNCTGTACNNAGGAT 
ACCAANNTGTTNTTGCTNATNCACGCTANAAGGGNGACTATTANGCCTAAGGATATTCAGNT 
GGCNAGGCGTATTACGGGAGAAAGGGC 

SEQIDN0189 

CGTTAGTACGCNCATCTGATAATGACNTTGAGAGTGCAAAATGCATGAGCTTGTTGTCAAAT 
AATCATCATCATTTCAGTACAAGGCCAAAGGCTGATTTTACAAATCAGGTCATCCGCAAAAT 
GAGGGACTTACCAGCTGCAAAGTTATCTCAGTTGC 

SEQIDNO190 

ACCTTGGAATTNCTTACTGAATGACTAAATGACTTGCTCGAAGGACGAGGTGGTGGCATTAG 
AGCACGGGCAGATATTTTGCCCCTTCTTGATTTTTCATCGCTGGACATGCTAGAAGCTGCTT 
TCACTGATGTACTAGAAGCTTGTTCTCTTGCACTTTTATCAGTCAACTCGTCATCAGCATCA 
TCACTTGAGCTTCCACTTTCAAAGAAGTCATCGCTGC 

SEQIDN0191 

ACACCNATTTCAACTACCNNGAGCTGACAGCNTACNCAATACACCGCAGANGCTTTTTGCTT 
ACTCCAAAGTTGCTCCCTAATCTGGAATACAGCGAAGCATGCAGCATTTTGACTGTAATGAA 
TGGTCCxrGGATTGAGCAGCCATCCAAAGCTTGGAGTGGTGGAGAATGCTGGAGGTCTGTGA 
TGGNACTAATGGNGGAACGTTGTCGCTGACTAGAAGCAGAAATACTTTAGGTTGATTCGAAC 
AGGATTTGTCCATAAGAAAAATTTGCTTCCTTGTTGATCTGCCATTGCGCTAGTTACAAGCT 
G7\ATCATGGTCGCTCANCTATGTTTTTTGAAAAATCTGTTATTACTGGCCCTTGTTCTAAAA 
TAACATAATTCTTTTGTACGC 

SEQIDN0192 

CGTTTGNNTCACGTTCTAATGTACGNTNACTTCATTGGAAACANTCCTACACATTCAGAGAC 
GAAACTGGCANACTCTTACCTCTATGACAAAGCTACATGNATTCTTGCTNGGAAACTCTTCC 
TCCCGGAAACAGATTTCAATCTGGACCATCTAGCTGCAAATCCTCTTGTACCAGAAAAAGAT 
ANACTCTTGGAC 

SEQIDN0193 

CCTTGGAATTGCCTCCATTGNTGCGGGCCAGGCGTTGCTGATGTATGGAACGAGTTGCTATT 
TCAATTTTTGTGANAAGAACAGGTGCACTTTGTAAGTAATCTTTNCACTATCATTGGAGAAA 
AGAAAAGTTCACACCTTGAACGTAATGTACATCTCGAATGAGCACGCCCATGTTTCTACTGT. 
TAGTA 

SEQIDN0194 

CCTTTCTAGTTTGCATCACNTGCATTTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTC 
TAGGACANGTGTACCCAAA 

SEQIDN0195 

CCTTNNACATTTTCTGGTTAGCCTCTGGTTTGTTTTTGATGTTTTTAGCACCGGTGTGCATA 
ATCCAGTGTGC 
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SEQIDN0196 

GTNNGNGAGCNTGGCNNAGGATGCAGACTACCAAAGCTCNNAANNAAGCTTCTNTGAACACT 
CTCNTAATAGGTNAGATGTAATGGTCTTTCAGAATGGCCAACACAGCAATANTGCCTTGAGT 
GTCCCAAATCACTGATGCCATGATGTGGGCTCCTAGACTGNCCTGACTNTNGCTTTGACTGT 
GCCANTGCCACCTCNTGGTAGCCNTNAGTTTTCATGATGCTTTGCCTTGGAGATCATATTGN 
CGAAGCCATGTTCCAACTGCCGCTTACAATTTGTCAGAGGGATNCGTCAGGATCGNGAACCC 
TCCTGT 

SEQIDN0197 

TGCGGANCCNGGGCATGNCAGCACGCCNNAGGACCATGCNAGCAGNATNACTGCTNCAGNAG 
ATNGNNATGANGNGNNTGNCNNNATGTTTGTTAGTCCNGCGTGTCTTTTAATANATCATCNN 
TNACCGCNTAGGTTNGNNCANCACTGNCGAGGCTNTTANGTNANNTAAGAGTCTGTCNTNGT 
CTAG 

SEQIDN0198 

AGTGC7\ATGAGTATAGCTATTGAGGCTATTGAATCTGGCGCTGTCAATGCTGCTTCTGTCCT 
TGAGCAAATTGAGCTCC7\AATAGCTCAAGCTAAAGAGGAAGCTTTTAGCAGAAAAGACATTC 
TAGACAAAGTCGAGAAATGGATTGCTGCTTGTGAGGAGGAGTGTTGGGTTGAGGAGTATAAC 
AGGGATGAAAATCGCTATAATGCTGGACGAGGCACCCACCTTACCCTGAAGCGTGCTGAGAA 
AGCTC 

SEQIDN0199 

CAAAGTACGANAGCGATGGNTCTATCTCTCAGCTGCAGGCAATGCCAAGTCGCCTTGACTTC 
ACCACTGAATTCCTCTCTCTAGCTGCTCATGAAGCTATTGTCTGTCGTTGTCATCCAGTTAC 
TGTTGCTTCTCTGTCACTTCTCTTGAACTTCTATCCCACAGGGAAACAGATGCCAACAACTG 
AGGTTGTAGTTTTCAGGACTTTAGTCACGACTCTATCTCAGGGTCCTCAGAATGATTCTGAT 
ATCCTAAAGCT^ATGAAACGAGCTCACACTCGGCTATCTGAGCTAGGTGCTGACAAATTTTT 
TGGGAAAGGTGAGATTGGGAGACGGGAAAGGAACTGGTTTTCAGTGAATGCATGGAATTCCG 
GTG 

SEQIDNO200 

CTGGGCNGACATNGCCTANTGNGGNNTTTCTGAGGNNTCCNATNGACATGATTGTGGGGACT 
CCAGGCAGGGTTCTACAACATATTGAAGAGGGAAACGTGGTTTATGGTGACATCAGATACTT 
GGTCTTGGATGAGGCTGATACCATGTTTGATCGCGGTTTTGGTCCTGATATACGAAAATTTC 
TTGCACCNCTGAAAAACCGTGCTTCGAAGCCTGGTGATGAAGGATTTCAAACNGTGTTGGTG 
ACAGCAACAATGAC7\AAGGCAGTTCAAAAGCTGGTTGACGAGGAGTTTCAAGGGATTCAGCA 
TTTACGTACTTCTACATTACATAAGAAGATTGCTTCTGCTCGTCATGATTTCATCAAACTTT 
CAGGTTCTGAGAACAAGCTGGAGGCGTTGCTACAGGTTCTTGAGCCAAGNTTAGCAAAGGGC 
AATAGAGTGATGGTATTCTGTAACACGTTGAATTCCAGTCGTGCTGTGGATCACTTTCTCAG 

TGAAAACCAATTTTCTACTG 
SEQIDNO201 

TTTNNCAACCTTTGGTATTGNGCTCACTTTTTNCTATGGCAGNCTTGGCTTCTTGGTGGCTG 
GCAAGAACGGAAAGAATGGTCAANNCTTCATCAACCATGCAATTGCTTGAATCTGTCAGCAT 

CG 
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SEQIDNO202 

ANGTTGGCCAAGGCACTTGCGAAATTCTTTGGTGCCAGGCTACTGATAGTTGATTCTCTCTT 
ATTACCTGGNGGATCAACTGCCAAAGACATTGACTCTGTAAAGGAAAGTTCTAAACCTGAGA 
GAGCAAGTACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO203 

CACTTGCTACTCTAGCAACTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATA 
TGATGAGTCTAAATTAGGAGTGAGGCATTCTCAAATTCATTGCTCAGGGAGCAGAAGTTGAT 
ATGTGGATTGCTACTATTTGCAAGAGCACTTTGCGGGCATGTTAGGCAAAGTCATGTTTTTT 
TGTTCCTGATCAGCATTCTTCACTATCTGCCCTTTGAAACAGTTAGCCATAC 

SEQIDNO204 

NGACCGCNGCGATNCTAGAATCAGTTGANANTTGNNGNTNGGACATGGNATNTCTNGCNCNT 
GAAGCNTTTTTGTCATCGACGATNATGAATTTCTACATCCGGTNCCTCCAGAAGACAAAAGC 
TCATGCAGANGTAGATGAGTATCACTTTTTCAATNCATATTTCTACAAGAAGCTCNAAGAGG 
CTGTACTGAGCAAGAAAGGAATNGCANATGCTTTGNTGGAATATCAACTNCCAACAGCTACC 
TGAAAGCAAAGTGCC 

SEQIDNO205 

TCAACCAAGNTGTTCCTATTGGTTCAAGCTTCTTCTTCAATCAACTTGCCTTTGCATTTTCT 
TCCAAAGAGGNATTTTAGGTACAGNAAAAAGATAGTGCCTTTGAAATTGCGGTTTAGTTGTA 
AAAGCAAAAATCTTGAAAGAGAAGCATGTGCTGATGATACTCAAACTGCTAAAGCAATCACA 
TCGCATAGTTCTAAACTCGAAGACGTTATCTGGTTTCAGTGTCGGCATATTATCAAGGGCTT 
GGCTTCCAGGATTTCTCCAACTGAAGAGTGAGGATGGCTTCCTTCTTCC 

SEQIDNO206 

NCAATGACAGTGCTTGCTCCAGCTTCCCTTACTTCCCCACCTTCTGTGGTGTCATTGAGCAC 
TTCACCAACATCACCAATGAGTCCCTTCATTGGTTCTTCTGATTTCACAGAGAGAGTGAGTA 
TCGATAAGCAAATAACTGCTGCTCAGAGCAATAGCTTGGTATCAG 

SEQIDNO207 

TTGCTTCCTTGATAGNGCNTGACAGANGCCTNANGACCNTCCANTCAGGATNACTCATTCNG 
GAGGTTGCNTGAGGAGATTTTNTTTATGTTTTTAGACTGCTGCNCTTTTTTATATCATCNTT 
TACCGACTAGCTTTGNACAGANNNGNCNAGGCTTTNAGGGGANGGTAGAGTTTGTCATAGTC 
TAG 

SEQIDNO208 

AGATGATAAAAGTCTGGACGAGGNTGGGGACCCGACGCTGCCCATCTTAGAGGACGGCGTAC 
C AAC T G AG AAT AAG AAT AT AAC T AAC T C AC AC CTTNCTCTG C AAAT T C C AG AC C T C 

SEQIDNO209 

CNNNNCACTCNNGAAATACTTTNNCGCCNGGCTACTGATANTTGATTCTCTCTTATTACCNG 
NTNTTTCAACTGCCAAAGACATTGACTCTGNAAAGGAAAGTTCTAAACCTGAGAGAGCAAGT 
ACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO210 

TTTTGANGGCCTTAAGCTACATCNGAGGAATGAAGAGTATTATCGGTTGATTTCACCACACA 
TATAATGCGCACTGGACCTACATAATCCTGGAAACTGTAATGTCAAGGNGGTTGAAGCGGAA 
TCTTCACAATCATTCAAAGGAAGAATCACAATGCTGCCTATCAGTTTTNATGATGC 
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SEQIDN0211 

CGCGTNNGTCTTAACGGCTGGTCGGCTGGCATANCGGTNATACGGNTTATTNTGCCAGTAAG 
TTTGGCCTCAAAGGACTGGCAGAAGCATTGCAGCAGGAGGTTATTGGCGAAAATATTCACGT 
ATCACTAATATTTCCCCCGGACACTGAAACTCCTGGATTTGCTGAAGAGAACAAAAGAAGGC 
CACGGGTGACTAGTATAATAGCAGCCTCTTCTGGTGCCATGAAAGCTGACGAAGTTGCCAAG 

ATAGCTTTGAATGGCA 
SEQIDN0212 

GGCANTTNTTTTCTATTACTTCCCAGCCTTGGGTGGAATGGAGTATGTCTTACATCACGGGC 
TCTCNATGTTTGCAATTGTTCAATCCCTT 



SEQIDN0213 
GNTGGAATGCC 

GATGAGAAAAGATCATATTTTGTGCAAAGCAATCAACCAAACTTAGTTGCTGTTGCTGTAGT 



GNTGGAATGCCNAAAGAAGCANNCGACCCCTGTCCTATCAGTGTCTATTCTTCAGTTTGCTA 



SEQIDN0214 

CTTCTTATCTTCTACAAATTGGCTCAATATTTTTAGTCTTACNATTTTATCTTTTTNTAATT 
T T N AT AAAGAN AT AT AAT TN AT T T G AGNGA 

SEQIDN0215 

TTCNGTTACTGGGGNTGATGATTTGACAAACCCGAAGTGGTATGTGGTGTGGTCTGCAAATA 
TGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTACAAATATGGACGTCATATGTCAGGT 
CAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGCTTCAAATGCAATGGGTACA 

SEQIDN0216 

GNAGCAATTATCCTTCCTTTTCTCTTTTCAATTATTTTTCGTAAGGGTAGTTGCTGATAGGT 
TTGGAGGTCCAATGGCGATTGGCATGAAGCAGATGTCCATAATAATAGCAACCCTTGGTGTT 
TTATCCTTTGTATTTGGAGTTATTGCTGAAAACAAGAAGCCTGCANCTGGGACTGCAATACC 
AGGAAAAGGCGTTGTTATTTGTAAATACAAGTCTGACCCTACTGTTGCCTTGGGCTATTTGT 
CTTTTGCTCTTCTTGTTGCATCTTCTGTGGCCGGTTTCCTGTCGTTATTTTATCCGTATCAA 
GGGAAGTCAATCCCACAAGCTGCTTTGCTCAAAAACACTANTTTTGNTGTGTTCCTCAACAT 

CGCATTGGGCACAACTGGTTTAGCAGCAGCA 
SEQIDN0217 

AATGAATATCTCCATGACTAGAAAATTGTAGACATGACACATTCTTTTCTTCTGCTTTGCAG 
GCTCAGTGAAGCTTTCATTTGGCTAGATTCGTGGATTTTGTTATCAGT 

SEQIDN0218 

CAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAGGAAATAGACCATATGATGAT 
CCATTCTTGGCATGTTGTTGTTGTCCTTGTTTTGTAGTTTCTTCTACTTTCTCTGTG 

SEQIDN0219 

GNAGGTTCATATCTCAAGGTCAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAG 
GAAATAGACCATATGATGATCCATTCTTGGCATGTTGTTGNTGTCCTTGTTTTGTAGTTTCT 

TCTACTTTCTCTGTG 
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SEQIDNO220 

GGAAATACCCCACGCTTCAGCAGATCAAGGAANCCTTCCTGAATCAGCTACTTCTGCTGCAC 
ATGAGGCATCTATTCGCGAATTTGCNGAGGCTGTCCGTGCTTATCGAGCTATTTTCCATGAT 
TCAGAACAGCAACTCTCTAGACTTGCACAAAATGTACCTAAAATGCATTTCGAAGCNGCCCA 
GCAGCACATCAAGAAACGACTTGCTTCTTCCAATCTTGTTGCCATG 

SEQIDN0221 

GAAAAGCAAAAATAAGNCGACATGGGGTTTATTTGGAAGAAGAAGATTGGGACTTTGAATAA 
TAAG^AGCGTAAGTGGGAGGAGAATAGATCACGGCGGCAGCCATAGCTCCAGGGAACACTAC 
ATATTTAGCTATCAGCGCCACCTTCCTCACCAATGGCTNTCCCTTCATCAACGCCATTTGC 

SEQIDN02 22 

NCTNNAAAGGCCACTCAAGTTCCTGATATAAAANGACATAGTATTTGCACATCACCTTTGCA 
CATCC^TCCATTTTCTTCATATCAACTCTAGATCACTCATAATATCCGATGTAAATGTCATG 
CAAATAGTTATTGTACTATATTGTGTAAGGAATAAGGACAAGAAAAAAGTCTGTACATGTTC 
AGTACAGACGCAATTTTTTTTTCCAATATTTCCAATCCTTGGTTGCC 

SEQIDN0223 

TTTGCANACAGTCCCNTACTTTCCCGACTACTTNCAATAGGATNCTGAAGATGCCTTTGATT 
CATCTTCTGACAT iCTAGCCTATCCGTGGNCAATCAACACAAAGTATTATAATGCTGATGGT 
TCTATTTGGATGGCCCATCTTNATGAGGACTTNTCCATTGGAGCTTTGCCAGCATTTGACCA 
CCTTATTGCATTGGTGTTGGTCTTCGATATTAGTGATCTCTCATCTTTTGCTGCGCTGAAAG 

ATTGGGTTTCTCGCACCGACATCC 
SEQIDN022 4 

ACCCAAATCCTCCTCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTT 
GGTTGCAAATGAAAGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTGAAACAC 
CTGCTCAGCCCACACCAAAGAGATCGGTGCCAATTACCGAAAATAAGTACAAGAGTATGACA 
TGCCAAAACTCTGTTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGG 
TGGTGAAGAGATATCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATA 

ATGCCC 
SEQIDN0225 

TACCGGATGANTTTGTNGATGATGANCATTCCCAGTTTTTGNTTGCACCTGCTNTTTTGGCC 
AGNTTTCNATTGAGAGAGAGGAGACGCNAAAAGCATGCTGCTGNTTTAGNGAAACAAGATGA 
TGAGGNAACCGTNAAGCTTGAAAATGCTGCCCTTGAACGCTCTAAGTCAGTTGACTCTGCTG 
AGCTGGGGAAGTATAGCATATGGNGGAAAGAGAATGAANATGAGAATACTGATTCAAAGGNA 
CNCTTGANGCGGGACCAAATGGNTACNGNAAGGCTGTNTATAANCATNGCAACAATGAAGAA 

AAAGATNGGCTTGGCTCAAGAGTTAGAGAATCGGC ' 

SEQIDN022 6 ^ m ^ m 
TTTCTTGCNTACAGGATNTCAAGGCATTGTTCCCCTCTGATAGAAATCCCTTTTATGCTGGT 

TTTGGAANNNGAGACACCGACAAGCTCAGCTACCTCAAGGTTGGAATACCTGAAGGAAAAAT 

CTTCACCATCGATCCAAAGGGTCAAATTCTTATGAACCACCACATAGATACAAAATCATACA 

CCTACATACATGGTTGCGTCGATGACATGTTTCCACCCTTGTCCTCACGTGAGCAGATTTGT 

TATGGACAAAGTGTTACTATTACTAAGAGGAGGTATTTTCGCCGTTGATGGGTAAATGACTG 
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GAGAGTGATGCTACTGATGGATTCATCGTTGCGAATGACACACATACTGGAATTGTGATGAC 
CTGCATTTGAAGAAAATTCTGTCTCTGAACCTTTAGTAGGGGATATGGTCTCTGTATAACTG 
GTTTTGTCAAGAAATGTCCACC 

SEQIDN0227 

TATCAGNCGAATCTAATTTTGTACCCGGTGGATTGTTATGTGGTCCTCAATGNTNAAGNAAT 
ATGNNCGNTNTTGT 

SEQIDN0228 

TTATGGCTTTCACTCCAAGAAACCCTCGTGCTGGAAAGCCACCTGATCATTACATAGAATAC 
ATGCGC 

SEQIDN0229 

CTCCTATACCTGANAGGACTCTCACTTTGGAGCCTCCTCTAGGTATNTAATTCCGNCCTTTG 
NNGGGAGACTGACATCGGNTACATGCAGCGNNGCTTCAACTGTAAATATTCCACCTTNATGG 

CCTTGCCNTCGT 
SEQIDNO230 

CACCCAGAAAATNATCTATAAAGTATTATGATCCAGGACGAGCTGACTAAACTAGCTGATGA 
GGAAGATGACGAGGAAGAGGAAGGCGATGCTGAGAAGGATGTAAAAAAGCCTTCTGGCAAAG 
GTGTGAAGGCCTGAAAANACATGGGNAGANTGTNANCACANNANAGGCCCNCNACNCTATCN 
ATCAATATCCAACCTTTCTCTTCCTCGTGAATTTGTGCCTTGTGAGTTCAACCAGCTGTAAT 

CTATTC 
SEQIDN0231 

GANNCCNTTNNCTTNCTAANTAANNGCAAAAATAAGCGACATGGGGTTTATNTGGAAGAAGA 
AGANTGGGACTTTGAATAATANGAAGCGTNAGTGGGAGGAGAATAGATCACGGCGGCNGCCA 
TAGCTNCAGGGAACACTACATATTTAGCTATCAGCGCCACCTTNCTNACCAATGGCTCTCCC 

TTCATCAACGCCATTTGC 
SEQIDN0232 

GCCTTGANTTTGCGCNCCNAACAGGATGATCTTGTAGATGATGATCATTCCCAGTTTTTGGA 
TGCACCTGCTTTTTTGGCCAGAAGGCAATTGAGAGAGAGGAGACGCGAAAAGCATGCTGCTG 
CTTTAGTGAAACAAGATGATNAGGTAACCGTNAAGCTTGAAAATGCTGCCCTNGAACGCTCT 
AACTANGCTGGCTTTTTTGTGCTGNACCAGAATCCCTTATGNNGNACANNNANTGATANTGA 
GAANTCNNANTCAAAGGTACGCTTGATGCGNGACCAAATGGTTACGGCAAGGCTGTATATAA 
GCATTGCAACAATGAAGAAAAAGATTGGCTTGGCTCAAGAGTTACAGAATCNGC 

SEQIDN0233 

TTGGACTTGAGTGCTTGTAGATGGTGCCTTTGCCGATACCCACGGCATCCGCAATCATCTCG 
ACGGTGACGCTGTCTTCACCCTGGTCGAGGAACAGCTTGAGTGCGGTGTCGAGAATTTCCTG 
CTCGCGGCGGCGAAACTCACGGACCTTGCGGGGTTCTTTGTGCATAAGAAAAAGGTCTGCAG 
AGGTGGATATTGGGAGGGTGGCGTGCCCCAAGCCGTGCTTGTCGGAACGTTTTACCCGGTGC 
GCGGGATTATCCCGACTGAACGGTCGTTGGGCAACGCCTATATGAACAACCTTTGCAC 

SEQIDN0234 

TCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTTGGTTGCAAATGAA 
AGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTGAAACACCTGCTCAGCCCAC 
ACCAAAGAGATCGGTGCCAATTACCGAAAATAAGTACAAGAGTATGACATGCCAAAACTCTG 
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TTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGGTGGTGAAGAGATA 
TCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATAATGCCC 



SEQIDN0235 

TCATGTGCATTTTGACTTTGGAGACTCAACACAGGGGTTGGGTCTGTCTAGGACAGGTGCAC 
CTGAAATGAAAAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTATTCAAGGGTA 
AAAAGGTCATTTGGCGGACCAATGATAGTTGAGGGCAAGTGAAAAAATGAAAAAAATGAAAA 
AAGGGAAAAAGAGAGGGTGAAGTGTGAGGATAAAGCGAGCGGGGCCTAATTAGGTTATCTGT 

TACATTTTTGT 
SEQIDN0236 

TCACGTGCATCTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTCTAGGACAGGTGTACC 
TAAAATAACAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTCTTCAAGGGCAAA 
AGGGTCATTTGGCGGACCAATGATAGTTGAGGGAAAATGAAAAAGAAAAGAAAANGAGGTTG 
AAGTGTAAAGATAAAGCGAGTGGGGCCCGATTATATTTTNTGTCACATTCTTG 

SEQIDN0237 

AGCAGNCANGGATNAAATGGGAAAAACNTGTCAAGCCTANCTCTACCACCAAAAGAGANTGA 
AAGATCNGACTNGAGCACACCACTNGATACATAGGTATCAGGCACATAGAAGATTAGTTACT 
GTTTGCCAACCGAAGAAATTCTTTCACTACTGATGGCAAGCATACCA 

SEQIDN0238 

AGAGGTATCCTTCTAGTGTGGATTTTGATACTGGGGTTGATGATTTGACAAACCCGAAGTGG 
TATGTGGTGTGGTCTGCAAATATGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTACAA 
ATATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATG 

CTTCAAATGCAATGGGTACA 
SEQIDN0239 

GAGNGGCAAGTCCTGGCGCTCTATTTCCGAGAGNAGAAAAAAGAATTTTGTTTTTGCT 
SEQIDNO24 0 

GGGTNCAGGCNTCTGGCAANTCCTCGGTGTCANNGNTACACAACTGGAGATGGAAAGAAAGT 
TTATGTCGCCAAAAATGGGCAGGAGTTTTCTGGTCAAAGCGCATATAGATGTTACAGAAAGG 

AGACTGGAGCTGGTT 
SEQIDN0241 

GGTTCTATCTAAANNATTTAATGACTTGCAGCTGAAGGATGGTNTGTACACTAGCAAAGCTG 
AACTGCGGAAACGTATCAGGAAACTCAAAAATGGGCCAGGAAGAAATCACACTGCAGGTGGG 
AGGGTTGAAAAGTCTAAAGGTTTTTCTCCCAATAGCTTTGATCGTGTGCTCCTTGATGCTCC 

TTGCTCTGCATTAGGT 
SEQIDN0242 

TANANTACGACCGAGAACATCCTNATCAAATGCTCTCAAGCTGTTCTTCATCAAAACCTTTT 
TTCGGGTTCATCTTCAGTACCATTACAACAATCTTTGATTGAAGCTGCGGGTAATGTTGTAC 
AACGGGCTTNGAACTCGGCCCAGCCCGAACATCAGCCCAATGTGGTAAAAAATCCGAAAAAA 
CGAACCAGGGCATCAAGGAGAGCGCCAACTACTGTCCTTACTACTGACACCACAAATTTTCG 
ACAAATGGTTCAAGAATTCACTGGCATCCCTACAGCTCCGTTTACTGGTTCAGCCTACACTC 
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GCCGCCTTGATCTTTTTTCTACAGCTGGCTCAGCGATGAGGTCGGGTCATTTGGATACTCTT 
GGGCCACTTTACCCT 



SEQIDN0243 

AAGACAGGGATGGCAGTGCTGAGAGNAGGGCAAAGATTGAGCAATGGAATAGGGAAAAAGAA 
GAGGCAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGAGTGATGGTGGTGA 
TCACTATGGAGAACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGAAGTT 
ATGGCTACTGATAGTAGTGTTACTCTGGGTGGAGTACAGGTCCACTGTGCTGTGATTTTGAA 

AAAAGCATAACCCTTCTATTGTCTTCTTTTTACCATGT 
SEQIDN0244 

GGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAACATTGACATCTGTTCAAACATCTG 

AAAT TG AAG AT TG GAC AAAAT AC AAGG AT GATG AT AT TAT GC AAC AGCAATC T T CCATC C AG 
GCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAAGGAACCTTTGTCTTCATTAGAAGC 
TGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAATAAAGGTGCTGAGTGAACAATATG 
CTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTTTTCCGCAGTTTCATGTTTGGTTAC 

CTTGTATGC 
SEQIDN0245 

CCGNCAAACAAAGTAAAAGATGCAGGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAA 
CATTGACATCTGTTCAAACATCTGAAATTGAAGATTGGACAAAATACAAGGATGATGATATT 
ATGCAACAGCAATCTTCCATCCAGGCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAA 
GGAACCTTTGTCTTCATTAGAAGCTGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAA 
TAAAGGTGCTGAGTGAACAATATGCTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTT 
TTCCGCAGTTTCATGTTTGGTTACCTTGAGCACATTCTGGAATCACAAGAATCAAAGCGAAG 

TTCATCGCA 
SEQIDN024 6 

GCAAACCTGAAAGAAANGANTGGCAATGATATTTNNTCTGATGGCNAAGGTGAANCCAGAGA 
TTACTTTGGTGGCGTGCGCAAACCACCAGGTGGAGAGAGCAGCATTGCACTAGTTTAGATGA 

TG 

GAATTTTAGATTTCAATGGCTCTCAATGAGTTACACGGAATCAAGNTCTAAAGTACCTTTGC 
GGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGTTGG 

SEQIDN0248 ,„„,-,„, 
CAGGGTGCTTTGTGACATATCCCTGCACTGATCACCCAGGTGACCTAACTCTGGTCTAAGCT 

CTGCCTAAAGGGGCATTGTGACAGATCTCTGCACTGATCACTCAGGTGATGTAACTATTGTC 
TAGGCTCTGC 

SEQIDN024 9 

GAAGAGTGGACTCTTTATGAGCAGGTAGCTGTTGCAGCTATGGATTGTCAGTCTCTTGATGT 
GGCAAAGGACTGCATAAAGGTATTGCAAAAGAAGTTTCCAGGGAGCAAAAGGGTTGGTAGGC 
TAGAAGCTATGTTGCTAGAGGCCAGAGGATTGTGGTCAGAGGCAGAAAATGCTTACTCAAGC 
CTTTTGGAGGAAAATCCCTTTGATCAGGTTGTACATAAGAGGAGGGCAGCCATGGCAAAGGC 

GCAAGGCAATACGTCAGCAGCAATTGACTGGC 
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SEQIDNO250 

TTCTGCTGGAAAGTACTATGATGGTAGATTTGATGAACCCCAACAACAATATTTTTTGGATG 
CTTGTTTCCTTTGTAAGAAACGCCTTGCAG 



SEQIDN0251 

AAACTAATATACGAGTTGTGTCTGCATCTTCCTCAACTGGTTTCATAGATTGATCAATTGTG 
GCACCGGCAATGTGGTGGCGAGGTTTACATCCTCGAATTGTTCCACCGGTCAAAAGATGACC 
TGCTTTTCTGAAATGAATTCTGTTTCCCAAGCAG 

SEQIDN0252 

GCACATTTGAGAGCNCNNGCGCANTGNCATNTCTTNAGCAGNGGAAGAGTAANTTCTAGATG 
TAAATACCCTGCTTTCCCGTAAGAACTGGTTTATATTGAAAGCAGAAATGCCTCTGCTGGCC 
AATTTTCGACTTATAATTCCAGACATATCCACTTCCTC 

SEQIDN0253 

AGCAAGTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATATGATGAGTCTAAA 
TCAGGAGTGAGGCATTCTCAANTTCATTGCTCAGGGAGCAGAAGTTGATATGTAGATTGCTA 
CTATTTGCAAGAGCACTTTCCTGGCATGTTAGTCANAGTCATGTTTTTTGTTCCTGATCAGC 
AGTCTTCTTTNACTATCTGCCCTTTGNAAGAGTTAGCCATACGTTAGAGCAATGTGTTCTTT 
TCAATGTTGGATATTTATTTGAACTTGATC 

SEQIDN0254 

AGTANGCGTAGGGAAGACAGGGATGGCANTGCTGAGAGGAGGGCAAAGATTGAGCNATGGAA 
TAGGGAAAAAGAAGAGGCAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGA 
GTGATGGTGGTGATCACTATGGAGAACAGCTNGATGANGATTACCCNANGCAGCANTAGGTA 
NCAAGATGGAAGTTATGGCTNCTGATANTANCGTTACTNTGGNNGGAGTACANGNCCANTGN 
NCTGCAGATTTTGNANANAGCATANCCCTTCTATTGGATTCTTTTTACCANGT 

SEQIDN0255 

AAGCTAACAGAATCGTTTGTGGAATAAGGGTGTCGATTCCGAGAGCTTCCACCCCCGTTATC 
GGNCTCATGAAATGCGACTAAGACTAAGTAATGGAGAATCCGATAAACCTTTGATAGTCCAT 
GTTGGACGACTTGGAGTTGAGAAGAGTTTGGATTTCCTCAAAAGGGTCATGGATAGACTTCC 
AGATGCTCGCATTGCTTTTATTGGAGATGGGCCATACAGGGAGGAATTGGAGAAAATGTTCC 
ATGGCATGCCTGCCGNGTTCACAGGTATGTTACTAGGAGAGGAGCTTTCCCAAGCATATNCC 
ANCGGNGATGTTNTTCTTATNCCTTNANAGTNAGAGACACTGGGGCTCGTCGCTTTGGAGGC 
CATGTCATCAGGGCTTCCTGTAGTANCTGCCCGTGCCGG 

SEQIDN0256 

CGNTTTNTNCTCGGNGNGTCAGCTNNGNGGANGCNCTGGGTGCTGGTTCNNAGGNCTNATGA 
AACGCTCNAAGGCAACAATCTGGTTATGACAACTGCGGGAAAAATTCCCTTTCTGCGCAAAC 
TCTCAAAGCGATGGACTNGCNATCANAGCAANTTTTGCATCTGC 

SEQIDN0257 

ATGANGANGAGGATGAAGAAGATTATAAGCCACCACCTAGGAAGCAATCTGATAATTCTGAT 
GAAGATGCGGAGTCTTTTCCGTTGAAACGAAAGCTATCTCCGAAAGAAGAGCCTGAGCCAAA 
AAGGTTGCAGCGGATTGCTAAAGGCTCAAAGTCTCGAGACGGTGTTTTCGCTGCTTTGTGCT 

CAACC 
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SEQIDN02 58 

ATGATCAAGCCCTAGAATTCGCGAAGATGCTCGATCAATCGGGAACTGTAATTGTTTTGGGA 
AATATCGTATTCCTGAAGCCTGACCAGGTGGTGAAAGCCATGAAAGGCCTAATGCCAATGCC 
CTTGGCCGAACCAAATGACCCAAAAATGATGAAGGAGCTTCAACAAATGGAGGAGAAGAAAG 
CAGCAATTGACAAGAAGGCAGAATCATTGGTGCGGACAGAGTTGTGGCGTGGACTAGGTTAC 
TTTGTGATTCAGACTTCAGCTTTCATGAGGCTCACTTTCTGGGAGTTATCATGGGATGTAAT 
GGAGCCTATTTGCTTCTATGTCACATCCATTTACTGCATGGCTGGGTATGCTTTCTTCCTTA 
GGACCTCCAAAGAACCTTCTTTTGAAGGGTTTTTCCAGAGCCGGTTTAGTGCAAAGCAAAAG 
CGATTGATGAAGCT7CATAAATTTGGATCTTCATAGGGACCAAGAGCTCCACAGAGCTTGNG 
ATCCTCATTCGACGATACCTGGTGGAAACACC 

SEQIDN0259 

CAGATGGTACTGTAAACATGTATGTTCATCATGAGATTATTATTCCTGCGNTTCCTGTCTGC 
ACAGCATGGATCGATTGCCCTA 

SEQIDNO260 

TTGNTTACTCNGCCCTTGNATTTCAATGNGCTAATCCATTANCCCNCACGGAATGACGNTCT 
AAAGTACCTTTGCGGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGATGG 

SEQIDN02 61 

CAATAANTTTATTTGGAGGCTTTCCTTCCCTGCCTGGTTTGATGTCAATGACCTATCTGAAA 
ATGCTATTGATGATGATGAGGGTTTAGATGCTTCAGCAGCATATGTGGCGAGTTTGTTGGCT 
ACGGAGCCCCCTCACATCAAACTTGGGGTTGGAGGCTTCAGCATGGGCGCAGCGACATCTCT 
TTATTCTGCAACTTGTTTCACTCGTGGGAAGTATGAGAATGGCAACTCGTACTCTGCCAATC 
TGAGTGCAGCTGTTGGA 

SEQIDN0262 

GGATCAGGTTNTAGCAGATACACTATAATCANAGTTGNNGTGGTCATGGGGCATGGNTATAT 
TTGGNGGAAGGGGTGGAAGCTTNCCGAA 

SEQIDN02 63 

GTGTCCCAGCAAGGATTACCCAGGTGATGTACCTCTCATCAAGGCTCTGCCTACAGGCACAT 
TGTGATGTATCTCTGCACTGATCACCTAGGTCATGTAACTTTTNTCTAGGCTCTACCTACGA 
TGGCATTGTGACATATCTCTGCACTAATCATCCAAGTGATGTAACTCTTGTCTAGGATGTGC 

CTAAA 
SEQIDN0264 

CCGNTACTCTCCGCTNGACCAGNTCGTTTNCTTCCCCTTTTTCAGGCTGGTGACACACTANT 
ACAGTCAGTANGACAACTTCATCACTGATTTTGAGACAAAGATCAATCTTNTCAAGCTTGCN 
CATTTTGCGGTCATTNNTTCTCNGGAANACCCNGANAAAGAGGCTGNTATAGGTTACCTTGA 
AGGAGAGACTGAGAAACTTCNNNATACTAAGGAGACACNGATAAAGGAGCCGATTCTTTATA 

T 

SEQIDN0265 
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GGCTGTTAGTGGCTCAAAAATTGTTGGCTCAGCCAAAGCAGAATCCATTGAAAGTGGTGAAA 
GGACTCGTCACATGCAGCCTACACTTNCGAATAGTCCACACCCTTCTCTTTCTTGCAATGCT 
GTTGTATATTCTGCATATGAAGCATCCAAGGACGAAGTAACCCAAAATAATGCACCAGCTAC 
TGATGATTGTGGATTCTTCGAGTCAGGCTATATGCTTGCGAACGGGACAGGGCCTCCTATTG 
GAGAAAGCAACTATGACGAAGCTGTTGAATTTGATCCAA 

SEQIDN0266 

ACAAATGGTTACAGATGTTATGGAAAATCTTGTCAAGAGGGCTATAATGGCTGAATCTGAAA 
CTGCTTTAGAGAAGGAGAAGGTAACAATAGGTCNTGAAGAGATTCAAAGAAAGGCGCTTCAG 
ATTGAAAACATGTCAGGTAAGTTAGAAGAGATGGAAAGGTTTGCTTTGGGTACAAATTGTAT 
CTTGAATGAGATGCGCCAGAGAGTTGAAGATTTGGTCGAAGAAACTTCTAGACAGAGGCAGC 
GAGCTNCAGAAAATGAGCAGGAGCTTTCTCGTG 

SEQIDN02 67 

GNNTNTGGANGCTGNACATNTCATCCTCANCNCAGGCCTANNCTTAGNNCNAGGNGCCNNCC 
ATNNTNCAGNTNNCTCTTNCCGNNATTCTANTNATTCGTGCACATGNNGAAACCTATGCTNT 
TGCGNCNGCTNNANGNACANTCANNNCTGCANNGNCNGANCCTTCNTGCNCANCNTAATCAA 
CCTTNCAACNGCATGATGACTCTTCATGCATAGCCATATGNTATCTTCATTACGGGCTTTTT 
CAGACATACCGCTTCGTTAGCAGGCATCTTACCC 

SEQIDN02 68 

GATATTCGTAGGGCGAGGACTGTTATCTTACAAAGGATCATCAAACCCCCAAACCACTAAAG 
TGCTGAAATTTGCCTTAGCAGCAGNGAACATTTATCTGCTTTTCATAGTTTGTGATG 

SEQIDN02 69 

GGGTCAATACTCTGTCTTCACTGCGATCGATATTTCGCGAATGTTGCGGTGAGGGACGAGCA 
TTTCAAGACGAAAAAGCACAGGAAGCGTGTGAAAATAATGATGGGCCCTGCACCACACACCC 
AACTTGATGCTGATTTAGCTGCTGGAATTGGCATGCCAGATAATGGTCCAAAGCTAATGTCG 
ATGAGTTGAGCTTCTTTCGTCCTGTTTATAACTCCTACATTACTGGTAGAGTTCTTTTGAAC 
TTTGAGAATTTGTCTGAGGAACATAGGTTTTTGTTAGTCTACCATCTCTCTCTCAGTATAGC 

AAGT 

SEQIDNO270 

TGATGACCTTTNNGNATCTNGTAATATNTGAGAACAATCCAAACGTTGAGAGCTGCAGCAAT 
TGATCAAGTTACCCTCTTNGAAGAACAGAAGATATTAGCTACAGAACAAGCACAGATGGTGA 
AGAAGCTTGGTGATTCAGAAACGAAGACTGCAATGCTCAAGTCACAGGCTGAAAGTTTAGCA 
AATTACTGTGATGATGTGGCCAGCACTAATAAAACACGAGCGCTGCAGAAGGGAGTCTGCAA 
GTATAGTTCCTATTTTTTGATACAGNTGGTATTGCTGGTTATCGTCTTTGGACTGTATGTTT 
TGCAGATGTCACCTGATGCTGTTGAAGTTGTACCGACATAATTTTGAGAAGTGAGCCTTTTT 
CCTTTTTCTTGTATTTTCAACATAAAGCAACGATGAACG 

SEQIDN0271 

AGGGTTATTCGGGTCGGACCTGGCGAATGCAATTGCTAAAGATACAACAATTTTTGATCGAG 
GTTTAGATACNCATTTGAGACCTACCATTGATTGTCTTAGGAAAACTTTGGGCACCGATGAA 

AATGTAG 
SEQIDN0272 

CACTCAAANTCCNGNCAGAATCCGGNGAANTTTTCGGCGAGACATTCCAGTAGAGTTCTTGT 
CCGAGGTTTTGACATTTCAGATTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 
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SEQIDN0273 

ATTGGCCGGTCTTGGACTTCAAAAAACNNTCNCAGAGCTTCGAAGTATCACTTCTAAACCTC 
AATCGGAGAAGAAAAAATATAACANAGTTGACTATTTCTCTACTCCTTTGCGCCGTTCCGAT 
CGATTGAAAGGCAACACCCCTCCCGAATCAGAATTGCGCCGTTCGGGTCGCTTGAATGAGAA 
GTCCTGCTACTCTGCTCCACCAGCAAAAAGGAAATTGGGGCTTTTTGAAGAAGGAGATGTTG 
AAGAAGATAATGAGAAGAGACCTGCTAATGCACCTCTCCTGAGAGTGAAAGATGGC 

SEQIDN0274 

GCCNGCTGTGNNCTGCAGTTGTTGTAAAGGTTGAAGTAGCTCTAGACAAAAGCATTTGCATG 
TTGACCAGATGAGCAGAACTGATGTTATTTGCAGTAGAAGGAGGAGGTTTCTTCTCGTCTTC 
AGCTTCTGGATATAGTAAGGGCCTGACCCTTCTACTCTTGGGTCAGAAGAACGAAGAGAAGC 
CCATGAGAGTTGCACCGTGGAANCAGTACCAGTTGGTGGACCAAGAAACTGATCCGGACCTC 
CAGCTGGCTTCCGGGAAGAACAGGGTTGTCCGCGGGTGCGCCTCCTTTGTATGCTTTGGTCG 
CGCTGCCGCTGGACTTGAGAGCCCATCTCCCC 

SEQIDN0275 

CGTTCTNCTGGATNGTTCCTGGCTATATTATGGGAGGGGAAAACAGGAACAAAGAGAAAGCA 
AGATTGCGAAAAGGTATATCTATTCTTGTTGCAACTCCTGGACGTCTTTTGGATCACCTAAA 
AAACACATCATCATTCTTGTACACGAACCTGCNCTGGATAATTTTTGATGAAGCAGACAGAA 
T T C T G G AAC T T GG AT AT G G T AAAG AG AT T G AAG AN AT AC 

SEQIDN027 6 

TCCAGGATGATGGCACTCCTGTCTCAATATTTGCACTTACGGGGAGTAATGCAAACGATGGA 
CATTTAGCTGCTGGCCGAAATGGAGTCAAGCGACTTCGCACTGTTAGGCATCCAAATATTTT 
GTCATTTCTTCACAGCACCGAAGCAGAAAATTTTGATGGTTCTACTACCAAGGTTACCATCT 
ATATTGTTACTGAACCTGTCATGCCACTCTCGGAGAAGCTAAAGGAATTAGGA 

SEQIDN0277 

ATGNGCAAATTTGCGATCCNAGCGTCAGATGAATCCATTACCCAGGAGATTGCTTCANATTT 
TCAGGGNTGGNTGNATGATCTAACTGATGGTGGTGTTGAGTACATGCCTGAAGANNAAGTAA 
AGGNGGCTGCTGCTGAAAAGCTAAAGATTTCAATGGAACGGATAGCATTACTAAAGGCGGCA 
AGACCTCCCCGAAGTCTCCAAAATCTGATGATGAAGAAGAAGAGGAGGAAGACGAGGATGAT 
GAG AAC C AAAAG AAAG AAG AC AT G A 

SEQIDN0278 

TTGTCTAAGATAA/y^AATGTAATAGTAAAGAGAGCTGCAGATGAAGACATGGAAACTGCTTC 
TATGTTGCTTAGGTGTTGCTATAATTTTTATAAGGACACTTTTTGTGCATTGCTCCCATCAG 
GTNTAAACCTTTATATGGTGCCATCTCAATTTGCTACAGAAACATATATCCAACCTGGGATA 
GATGCAGTTGACATACTCGATATGAACACTTCACGGAAGCTACTTTTGTGGGCCTACACACT 
TCTGCATGGCCATTGCACAAATGTCTCAGCTGCTA 

SEQIDN0279 

GCTTTCTTGCCTGCCGTAGACACAGTGNGAAGGGNGAGTGCCTACATGAATGNTTTAGAGTG 
AACCCTGATGGTGTCAAAGACAAAATTAGCTGTGGTGAGNTTCTGGATNTGACTCTNGAGGA 
TGNCGATAAATGCATAGAGCTTATTTNTACGCCGATCCGCAAAGATGCA 

SEQIDNO280 

GCGATACGAGGCGAAAAAACTAAGCTTCCGGAGAGTGTGAAAGCAGATNCCCTTACTAATGA 
AGCTTTTCTTGACCGGGGGTTTACTCGCCCCAAGGTTCTGATCATTCTCCCTCTAGCAAGTG 
TTGCATTTCGAGTAGTCAAGCGGCTGATTGATTTGACACCTCCTAAATACAAGTCTAATGTA 

FIGURE 4 (continued) 



BNSDOCID: <WO 030851 15A2J_> 



WO 03/085115 



93/140 



PCTYEP03/03703 



GAGGAGCGTGAACGTTTCTATAGAGAATTCGGGGCCGGAGTAAGCAAAGATAGGGAGGATGA 
AGATGCCGTCGAAAGCTCTGAATCAAAGAAGAGCTCAAAACCATCTGATTTTCAAGCATTAT 
TTGGGGGAAATAACAATGATCACTTCATGCTAGGAA 

SEQIDN02 81 

GCGGCATGTGAAAATCAACTGNTTGTGATATCCCACCTACTGGAC 
SEQIDN0282 

GNGTACGGGGNCCGGGCATAGATATGCCTGNANGGAGTNN'GACAAAGCTTGCAGAGTGGNTC 
ATCCTTGTCAGACCACCCCTGCATGTATATNTTCTNTTGNTTNCCTNTCCCAGTACAAAGAT 
GGACCTTACTCCAGACAGCGTATGGTGGTAACGGATAGCTAATTNAGTGCANAGGTGTTGNC 
CTCCTCTTACTTATACCTTTCAGCAGTCCCCCATTATCGTGG 

SEQIDN02 8 3 

GCTNACTNACATAATAATNANNCCNGAAAANTAAAACTTCTTTTNAATTATAATCATAAGCT 
CTACTCGGAGATGTG.^CAGCGAGTTTTAGGTGGACTTNTGT^AAGAATGCCTCGATTCGTNG 
TGNTCCAGAAGGAAGCGGCTTCTCTGTTGATAATCGAGGACGATTTTGAACCTTAGGAGAAG 
GATCANACGGCTGTGAAGGCACGGGAAGCGAGTCGAGAAGGAAATCGTTCGTAGGTTGATGC 
CTTTTCACAGCAACTC 

Group ,4 
SEQIDN0284 

GGCCATCGGAGCAAAAGAGAGCAACTTACATTCTTGAACTACGTGAAGAATCTTCAGAACCC 
T 

SEQIDN0285 

GAAGGTGAAAATGGATATNGCGATATCTCAAAGGCACNTNAGGTATGGCACTTTGTT 
SEQIDN028 6 

CTGNTCGAATGGGATATGCATATTCATATGTCCTATTGTACTAATCAGAGTTTCAAGATTCT 
GGCTT 

SEQIDN02 87 

TGGATAGGTNAGCNANGAGCANACGANANNCCTGACNGGGAAAGGGATGCANTCAGACTCTC 
ACTGGCTTCAGCAATTCTT 

SEQIDN02 8 

TTGAAATANCNNNTGNNAANNCTNACATTAGCCNCTCTGTTGTGAGGAAAGGCCTATTCCCC 
CTCTCTATGTACTTCATTTCTGNCATACAT 

SEQIDN028 9 

TGATNNATGCTCTNTAATTGCCATACTCATTGGTAATTGTGTTGATGNGCCTTNATAACGGG 
TTATNATGGCCTNCTCTCTTCTATTAGCGCCAAATGTAGGAAAGTCATTAGTTTGTGTTTAG 
TTCAGGAACAGACATATTTCAGCCGTGCCACCGGACATCGCATGATGTCAAACTCTGNGAAC 
TAATCTCACTAGAGACGAGAAGACNATGGCCCGCTAGT 

SEQIDNO290 

GCAGCAGAAGANATGAACCGAAATGAAGGCCTGAGTTCGGCCCCAAACAGCCGATTCAACAA 
C AG AAAT C AA T G C AC AG AT T C AAT CT C GAG C AG AAT G T 
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SEQIDN0291 

GGCATGANAGGAACATTCACNCGTATGAGCACGCATGTTGCAGANTCTCCTTCGNGGGGCTG 
NTCCAAANATTCACCACTATGTTAGCCCAGGAAATTCNCCTCCCCNTGATNCTTCTGCTCTG 
CAGT 

SEQIDN02 92 

NTCTGTGCCGGCTCNANTNNGGATACTACAGCCGAAACCCTANCGAGCGTATNNNNNAAGTG 
CGCAAGAGATTGACAGATTGTAANGCTGTTACNGAGAATGCTGNGTAGGGAAGTCCATAANG 
ACCGCGTGATTACTATGT 

SEQIDN0293 

NGGAGTAGTAATACCCGTGTGGATAGTACCAAACTCAATTACTTTAGGAGGGTATGTTGCTC 
AACCTACCAACTGGC 

SEQIDN0294 

CTTTNGGNAGTCCGAACNCCCTCNNNGANAGACCAAANNGATGCGNNNNGCTCNTGCAAAGG 
GTGAGGANCNNNATNNTNGCC 

SEQIDN0295 

TTGCAGAATTGATGTGGTTGCTTTGCTCTAAAAGTTGGAACT 
SEQIDN029 

TTTGAAGNCCTTTNANCNNCNCTNANAGGGGCTGNNGNTGGACGCANCACACGATTCACATT 
CTNCNCCTTAGNCGAACGTGGTGTTCGGAACAGTTTACATCACT 

SEQIDN0297 

ANCCCANGGTTANATGGNGAATCACACGATNACANANCTTCTCCTNAGCCGACGCCTGTACG 
GAACAGCATACTCACT 

SEQIDN0298 

GNTTAGNNANCCNNCGGTNNGNGATNGGATGNNGNTNAGGGNCTGNTTCAATCCTGTATAGN 
GACTCTTTNTTACCCGTTGTGTTCCNCT 

SEQIDN0299 

CCTAGANAGCGNGCTCCNGAAGAGAATAAGGCAATNGCCAAAGTTGCAAAAGTTCATGCCCC 
TNCGTTAGCAGNTTGGATCAATTGGCACAGGAGGGCCTCAGCTNTGCCTCGAAGATCTAAAG 
CTTTAC 

SEQIDNO300 

GGCCCTGACGTCTCCTCTATATTTTATTTCCTATTTCATCTTTTTTGCTTCAGAAACAATGT 
NTCCTTTTATTCTCGGACCTTGTATTTAGCAGTCTTAGAACGTCGGTGACATTGTGACACTA 
GGTTTTGGGTGATTATGGC 

SEQIDNO301 

NTTNTAACATCACGCATGCATAAC^lAACTGTCAATTGGTGTGAATATTCAGAAGTCTCTTAT 
TCATATCAATNCTCAGGGGGAATATNACNACTCTCCAGGAAAAAGACGTTTCANANACGGAC 
AGCTGCNAAGAGATGCAGTATGACAAGAAATTCATTCCTCTTCCTCCGCCTCCTCCAGCCAT 

TTCACAAAGGGCTCCAGCGACTTGACAAAGTTTTGCCTGCCC 
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SEQIDNO302ATAAACTCTCAAGTCCTGGCGGCCAGAAAACCAGCTTTCAGGCGGGGCGGNG 
GGGCTCCCCCTCCCCTTGCTTCGTCTCTGC 

GTAGGAGTCGNGGATGAGGANAGAAGNGTCCTGAGNAATNGAGGGAGANGGTGGANGAT 
SEQIDNO304 

NCCCANTGNTTTGACNCNGTGGTGNGAGGGGTNTTAANATGATTNAGTGCTATTNGCTAGAG 
TGGNTATAAGNCTTGGA 

CNCGATNGTAAACGCCCCGCANCGGNTATGGNTAAAAAGNAGACCCTCAACAAAATNANGGA 
ATTGANACNTANCNAAA 

ACNANTATNNGAAGGTAGAGNGTNTGATGGGNGAAAAACGAATNGGGACNGGGGGTGCNTAA 
ACNNNAGTCAGNTNGAAGAANATAGA 

GNTNNATNAGCACTCTGTTGTGAGGTAAGGGNCTGGTGCCCCTCGGGATGTANTTCANTATN 
GCCGGAGAT 

SEQIDNO308 
NTTTNGGGTG 

AGAACAAACTGNTCTGATTTTGTTCAACTTTTTCTTCTAT 



NTTTNGGGTGACAAGTCTTATGTCTCAGGAATAGCGCCATTCATNGGTCGCAAAAACCTTGA 



CACCTNTCAACAGCATCCAGCNACTCTAANCGCNAGAAAAACANCCGNGCCTNCATTGAAAC 
CNCCATTTTGCTTTTGNTGNTCGAAGCNCTNNTCNNCAGATCNCGATNCTGAAAN 

CCA^AGTNTCCGGCTCCANAGGGTTAGCAAGNGGGANGATGGCGTNGGGNNAGCGAGAATGA 
AAGCCTTCATNATCCCANGNAGAGAACA 

CCCATTTTCANCNACCNAANGCAGCCTAGGTTANAACCTCTNNNNNCTGNACAAGCANCAGG 
CTTTAAAGNTGNATGANTGAGGTCGANNGCGANCNTCTCAGNTNTNCCAGTATCCTCGCGCC 

TGAACCTA 

ANCCTGCNTGTTGTAACCGCCTGGGNTACTAATTGTATNANCTCTGCTATAAATTTTTTTAT 

nnSnctnnnnntggggantagaaccattttgttcanttcactttagnntttgtnatgnaatg 
aaataatagctatatccntnnnntgaannaaatgatggctgntgctgngggg 

CTGTTTTGGGNGNCAAGGATNNNGNCTGAGGNNNAGCGCCNNTCNTTGTTNCGCNANNAGNT 
TGCAGAACAAACTGNTGCTGATTATGCANAACNTTGCCTNCTG 
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SEQIDN0315 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0316 

NCNTNAGATGNNTAAAATGGTGNGNTGCTTNGGCTCTAANGAAGNNGGGGNACT 
SEQIDN0317 

NTCGTNNNNNNNCTGTGTACTGNNATATGTGTCTGNATTACTCCTGNTGTAATGCATTGACT 
TATACGGGNCTTGGG 

SEQIDN0318 

TNGNTANGCCCCTATTNGTTACAGGATCNCTACTTTCCCACANAANATCGNCCATNGC 
SEQIDN0319 

TTNTAANACCCCATNNTGCATCTCACATAATGGACCGGCCANCAATANGTGAATTAGCTGGA 
TGATATTCAAACGAAAATTCATCATCTCC 

SEQIDNO320 

TTGNAAGCCCTAGTTNTANCCCAGCAGGGGCTGCTCCTGAAGGGCAATTTTACCCACCTTAT 
TATCCACCCTATGGGTATACGCCACCACCTCTACCATATCAACANTATTATCCTCAACCTTA 
TCAAGCTACAACCCCACTCCACCTGGTGGTCAGCAAGCCACACATCAGCAGCAGCGGCACAA 
CAACC 

SEQIDN0321 

TTTGTAGCCCTAGTTGNTCCCAGGNGGGGCTGCTCGTGAAGGGNAANTNTACCCACCTTATT 
ATCCACCCTATGGGNNTNCGCCACCACCTNTACCATATCAACAGTATNATCCTCAACCTTAT 
CAAGCTACAANCCCACTCCACCTGGTGGNCAGNAAGCCACACATCAGCAGCAGCGGCACAAC 
AACC 

SEQIDN0322 

TGCCCAGGGAATGGGTATTGGGNGCAGTTGTACCGGGAACACTANATGACTATCAAAAATGN 
GCTTCACNGGACA 

SEQIDN0323 

GCTNNGGAATNGNANTGGAGCANNTGNACNNGGACACTACATGACTATCAAAAATGGGCGTC 
NTCACGACA 

SEQIDN0324 

TGAAACTATGTGCAAGAATTAGTCAGTTGACAATAATTTGATTGAGTCTTTCAATTCTTAGC 
ATTTTGGAAGCTAGATACAAGCCTATGA 

SEQIDN032 5 

ATCNTGATNNTCGGCCATCTGGTACNTGGAANNGGCGCTGGTGAGACTTGANTCTNGNCAGA 
GGNGGACCCCNAGCCACGAGCAGGATGCTGCATTANCATTGCNATCAGCAGTATAGGAATTC 
TCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAGCCA 

SEQIDN0326 

GTTAATACCCGGATGTGGAACAGGAACTTCAGTCTGTNNGATAAGAATTACCTCTCCAGCAT 
CCAGGCTCAGCAGACTCTGCATCAGATTTCTTCACAATTCAATGGTGCT 
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SEQIDN0327 

TTTTTNTAGCTNCTAANAGCCCAAATTTCTCCGAGNCCAAAACAAGGTCAAGGTCCAAACAG 
TGAATTGGCCTTGGAGCAGGGCGTGAAAGACTCTGATATAGATGCTGCAAAAGTTGCTGCAT 

TGAAGGCTGCTGAACTAG 
SEQIDN0328 

TNNGCNGATNNTAAANTCCCCTCTTCGACGACNACNGCTNAGCATGCNTNTGTCTGANGAGT 
NCTAAAGGCTGTTNCCAAATTTACTAGNTCTTGACATGCGTATCTAACTGGANTGATTGGTA 
G AN TATAAAANT GN GACAANNNGTN TGACTNG 

SEQIDN0329 

TTTAAAACCCTTNTAAAAACCGAAAAATGCTTNTAAAAGGGTCCAAGGCAGAGACCAAGAAA 
ANTAACTGTTGAAGANCGGANAGATGGAAGNAANGTANAATTTTGTNNAAGGATATGGTNAN 
GATTGTTTTTNAAGAGANGNCGNAAAACNAACCCCAAAATTCCTCCAG 

SEQIDNO330 

TTTTAGCGGCTNCTAAAGCNCGGACTAAGAGACCNTCNGCAAATGGCNAGGNTTGCNAGGTA 
ANNGCNTGNCNNNCGCNANTCNNAGTGCNCCCTTCNATNTTAGTACTNTTNCGNATTNTTAG 
ACTATNNANGGNGANAGTAGTACNGACCGGAANANGAGGCTCGAGACTTGTGACACCAGANC 
ANANTGNGCTACNCCCCCGCTAGGTATTGTACNCTTCCNNATGAACNTNNCGNTGC 

SEQIDN0331 

TNCNNNNNNNCTNCCGAGCNGNTNTCTCTGACTTAGGTNTATATTCTAGGAACTCTTCAGTG 
GGAAATGCCGTTNAAATTATGATACTAACTGTTAAGGTAGGAAAGATTACTGGTTGACACAG 

CATA 

SEQIDN0332 

GNNNNTGTNNNNGGNGTGNNCGCATNGGGTGAGTGGAGTTCACNAGNNTGGGNNACTGAAAT 
TTATAGAGACGCTANTGAGGGGGCGGAGNGGCCNNNNTCNNATTCNGACNTTCTNGTGCCNN 

ATNACNATTAGAAGNA 
SEQIDN0333 

CCCNNNGNNNNCGNGGAATTGCGANTTGNAAAGCAACNTGTTGTCATGNAGAGCAGGAAACA 
AAATNTCGTATCTCGATCTAGANCNTNAGCACANTACAGANNTATGNNACAGGCTGTGNGNG 

AGGTANTCANNTATCGGTTTGTA 
SEQIDN0334 

GNNNNNCNNNNGCNGNTCTGTGGTCTTGNCNTTGGANATTAAGCNCCTACTTNNTACGNTAC 
TGNANNAGNCNGCNTCTANGAGCAAGCNACNAGCCCTACTACTANATTNANCTACTGCCTTT 
ATGTNTAACAAAGNNNGAGCAAGANAGGACCAACAGATGCTACTAGCTAGAGTTGATCATA 

SEQIDN0335 . „ 

TAAANGNNNNGNAGCAAGGAAGCTCTAGCTTGAAGGATGCTGATTATNANTTTTGATTAGAA 

TTTTACAAATGTAAAGAATTATACTAATGTAAAGAACTACGTTTGGGCTTGATCCCCATAGG 

AGCTTAGCCCGGGGTACGTAGGCAACCTGTGAGAAAAGGAGAGATCAGGTGCAGCCCCTTGT 

A 

SEQIDNQ336 
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ACNCGAATNGNAAAGGAACCCGAAACTATGANTNNNAT^ACTNGNAATTCTTTGATGCTACAA 
ATTGGCACTGNATNG 

SEQIDN0337 

GNGTCCAATTNNGGTTTACGTGTTACTNTNGTTTTCCCTGCTCATACTAAGCTGTGAAGATG 
ATTTAGTGCTATTTGAGTAGCAGTGGTTGTAAGCCTTGGGA 

SEQIDN0338 

GTTGTGTNCCATTCNGCATGCTNTATTACATGNGTTGTATGAGGTGNNACTGATCAGGAACA 
C TAN AT G AC T AT C AAAAAT G T G C T T T AC GN C A 

SEQIDN0339 

CATTGTCTTCTTTTTNTTTCTTCTTTTGGCGAATTTTCTTTTGNTTTCTTGA 
SEQIDNO34 0 

CTGGGTGT7\AGTNGAAGAAGGATAATGGACAAGTGATCCAAAGCATTATAGGGACGACACTT 
TAGGCA 

SEQIDN0341 

NGGCCGGAGTGGGTGNGGNGANGANTGGATCGTTGGTGAGTGGTGNGTNNNC 
SEQIDN0342 

GCCGCATACATGCATATCCGNGGGGCAGCAGGATGCGGGAACAGTTTTTTNATGGGNACCCC 
TANTGCANGNNCN 

SEQIDN0343 

AATCCNNGTNTAAGATTNTCAGCNTTGGGCNAGAGNAAGCNCTAATCNTGATNANCANTGGT 
GAACCNAANTANCCAGTTACCACCT 

SEQIDN0344 

GCTCTTCTGTAAANGGTTATTTTTTGACTGACANNCAAGGGGG.TAAATTTTTANTTANNACC 
ANAANTTGNTTAAGGNNN 

SEQIDN0345 

ACGTACACATTCTCCTCAATTGCTCAGGAAATGGTATTGGGTGCAGTTGTACNGGNAACACT 
ACATGACTATCAAAAATGTGCTTCACGACA 

SEQIDN034 6 

GGTGCGATCGNCTGCCGAAGAAGCGTTGTACTTGNAAAATATCGGAGGAAATATCCCTGAAA 
TAACTGCCAACGCTGGTGCAGNCAAAAGGTACTATGTTCGNTCTTNNATNTAGCA 

SEQIDN034 7 

NCNGTTATAGTCGANACACANGGNATGCCCTCTNGNAAACATNTATTGTACNGGATGACGTA 
TTCTGATANTNNCTTCAAANAAAGANNCATCACTAGNGAGCACGAAAGATAAGTGTNTTNTC 
T C AAAG AAAT G AC C A 

SEQIDN034 8 

TNGCCGNTTNCCATGNNGNACNTGGATANTCNAANNCTNTCCGNNNGNGCTCGNGNNTANNG 
NCCGGCNANACACCANNCCNACTNTNTGTGACGCNTGNAGGACNANCTATGNTGGNAGGANT 
TNATAGNNNGNNCCANATCNGCNCTNGACAGNCACTNNCCTGNGACTNCNNTGNANC 
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SEQIDN034 9 

TAGNCCGCTNGTTCAAGAGATTNNGCTCTGGCATCTGTAAGTGAGATATCAAAGCGCACTTC 
TGAAACCCCTCAACGAGAAAATAGAAGGAATNCAACAAAGATTGACCAACCATTCTGTAGAA 
GCAGAACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTTGAATCAATAAACGAGAAT 
GGAAACAGAACATGTTCCCGTACTGNATTTTNTCCTTTCAGC 

SEQIDNO350 

CCCAAANTCCNTCTTNTACGATTACTCAGGAACNNATNATGNGATTGNNCTNGACCGAANGC 
CTTNTNCGTGATTACCTGGAAAAGCTGCAGCTGGACA 

SEQIDN0351 

TACTATGTTATTGTTCGTCANGANANTNTGCAACNGNTGNCCCA 
SEQIDN0352 

GCGAGGGCCTCCCAAGNTGAGTNTGNAGCNNGGNGTNANGNAATNAAGAGNAGAAAGAGGNT 
CANGCGGNNGAAAATGTAA 

SEQIDN0353 

GCGCGGGACCCTACCGAANGGGTAATTTGNAGCAGNCTCGTACAAAANATAGGAGGAGTANA 
ANGTAAGNTCNNGCGGAAGANNATGTAA 

SEQIDN0354 

TGTGNTTCTTCCTGTTATGGGGACTTGTTGGTTATTTCCTTTTTTGTGAAGCTCTGGTCGTT 
ACCTCAAAGTGTATGTACTTCCAAACGGAA 

SEQIDN0355 

ATTGTCTTCTCTTTTGGTTTCTTTTCTTTTGGCGAATTTTCTTTTGTTTTCTGCTTGA 



SEQIDN0356 

AAGCACTCTGTTGTGAGGTAAGGCCTAGTCCCCCTCTCTATGTACTTNATTTNTGCCATACA 
TTT 

SEQIDN0357 

GGATTCAANCCATCGAGGGTCCATNGTGGTCTCCGGCTTACGGNCTATTNGTGNTCAACTAT 
TNGGTGGNCCGCATNNTTCTTGTANACTANCGGGAANATCT 

SEQIDN0358 

TGCTCAGNTNGATCNAAGGGGNGTNTTTTNACATGGAACAGGGCAACTGCCTCTACTTGNTT 
TNATGCCTTTTTCATTNNGTNCATTTCTAGGGATCGGCCGT 

SEQIDN0359 

GATNNNNTANCCNNGGNCTATNAACGTTNCCGANGCAGGTNCGCNATGCTNTGNCCTTATNN 
CATNGCGAANGAGTACCNGGANANCCCNCNTGGACANACNTGAGGGCAGCCATGGGNAGGCT 
GANACAAAATTCTGGTTCACTAATTTCCATCTTTNCTTTTTNTTTATNNGCCAACACANTAA 

CTNTATTGGTACTAGAACATGGNATTACCTTTGGGT 
SEQIDNO360 

NACGCAGNNNAAANACGATGACGAAAGNCCGCCAAAACCACTGACTTGACACNTNNAAGATT 
GCTNGGGANCANAGGANGCN 
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SEQIDN0361 

TGTNNAAANAAGGCGTGCCGAGGCNGACGGATGTGNCANGTGTCNCANGACGATGTTACTGA 
ATNGGTANTTACANCGGGAATCTGTGGCGNTCATGC 

SEQIDN0362 

AAGNCGGAANGTTTGTANCCCGNACCNCANANAAATTCACATTG 
SEQIDN0363 

GGAGCACAGCAATTCNAAATTCTTTCTACCATTTTGGTTTCATATCTAAGTCATTCCCTATT 
GGGCTTGCGCT 

SEQIDN0364 

GTGGGATGCTGACNNTGNAGCTNTTNGTNTNGTNCCNNAGNNATTNCNNGCNATTAAGCAT 
SEQIDN0365 

CGCTANNNTAGCANTCCGATGTGAGGGANGNNNCNAGNCCCCCTCTTTATGAACTTGANTGC 
TGGCATACA 

SEQIDN0366 

GNAAAGCTAANGTGANNATTAGCACTCTGNTGGGAGGGTANNNNCTANANTCCCCTAANTAT 
GNTACTTAATTGGGGCCGTNCAT 

SEQIDN0367 

GCCGGCTNNTGNAGNGNCGNTGCTTNNTTNAGTNTNNTGAGCATGGNCCTNNAGAAAACGCT 
NGTGGCATGATGCNTNANGGGGN 

SEQIDN0368 

NNNTATCCCTGCTGTGAGGAGTGTTNTTCCTTGTGTNATGCCTNTATTTGNGTTTCCGCNNT 
TGTGCTCTTNTCNTAATGTATAGATTNTNACTGTAGATTCTCAT 

SEQIDN03 69 

GNCNGGTNGNNGAACTAAAGTAAGTNGGTAGGCATGGTGGCGAATGAACCTAAAAAGTAAAA 
TCTAACTTGCAGGATCAAACATANGNTCA 

SEQIDNO370 

CNCATTGTANATCAACCTATATGATGGACTTACGNGAAGTTTCCAAGACACATGACTAAAGC 
TGACCAAGTCTANTAGGCTAGNTCAAGCCCGTACCGTGACA 

SEQIDN0371 

TCGTATTTATGCNCATGAATGATGTGCAGTGNTGTGTCCTGACTNATNGGAGCCGTTGTCAA 
ACATGNNGTATGAGTAGGAAGNATTNNCTGCTCNTCTCGGNCATGNAGGNAGCCANATNNGT 

CNGNNAGTGCAGAT 
SEQIDN0372 

NCGATNCNNANGACNCANNNNNGCGAGGTGNGTAANANTTTGNNACCTTTANTNGCTGCACT 
ANGANATCGACNNGCNCNGTGANNGNNNNACNTGAGGAAANCANAGCNGGAATGNCTNAGTA 

SEQIDN0373 
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AATATGGAACTGGAATTATGTATCTGTATTACTCCTGTTGTAATGCATTGACTTATACGGCC 
TTGG 

SEQIDN037 4 

GCTTATAGTGCTGNATTTATGCTGATAAATTCTGTAACATAATAGTGAGGTTGTAATGTAGA 
TGTTGAAGAGCTACCTG 



SEQIDN0375 

GNTNAAGCAGNGTNGNTAANAGGNNGCATTTTCTAGTTTCAGATTTTTCTGTTCTTGGAGCA 
ATAACATCCATCTTTCTCCT 

SEQIDN0376 „ M . m 
CCGTTNCCCTCAAACACCCTTGAATCCTATCGAATCTGGATTTGAAGACGAACCCTAGAAAT 

TCCAAAATCCTAAATCGAGTGTTCGTTGAATTTTTCCAGTCTAAATTGATTTTATTCGTGTG 
TTCTTG 

SEQIDN0377 

TTTGNGATANNTTTAGTTGGATGGNATGGAATGCTTATCTNNTATNCGAAANGATGGT 
SEQIDN037 8 

TTGAATAACNCCAGNATNGGCNNAATACANNCCCTAATANCGAATGATCTGGTATTTTACAG 
GNCTGACGGGGGGNCGCCCTTTTCCGTGN 

SEQIDN037 9 

TGCTTGTANNANGCCNATGCTGTNTGGTGGNNCGCGCACGTNGTGNTCNNNTGAGAGGACAT 
NTCTGANTTGNGCCAGGNNCCNGANGAAGACTNCCGATANTTANTGCCGAGGCNCATGGGGG 

SEQIDNO38 0 

TNTAGACCCGTTTTATACAAAGCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCTGCTTG 
ACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAGGTGGA 



SEQIDN0381 

NGCACGGCCCTCGGNCTTGCAAAAANGTGGNNACACCCTCGGGGNCNNNGCCAGNGGG 
SEQIDN0382 

ACTCNANNCCCGCGTGCTCGCGCCAGCTCCCAATGCAAATGGNATAGAAAAATNCAATGCTG 
AGCATCG 

SEQIDN038 3 

NNCTNNAATGTAGCTAGTACAAGTGGNAGTGNGCTACACAATATAGCTTGACCCCGACAAAA 
ATNCTNCACGCACTAGNAACTCATGACATGGTATACG 

SEQIDN0384 

CGNAGCNCGNNCGNACACNNCGACAAAGGGANCGNCACANCC 
SEQIDN0385 
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GCCCCCTGTNGCTGCTCCCTNAGTGNTNGGNCATNCAGTGGTAAGCATATTGGCCTGCGCCA 
GCATACTCTAANCATGGTNTGNGATAGAATTCCATCACGCTACTCTNGNGGCNCATGAAGAG 
CATATCCG 

SEQIDN0386 

NTGTAGCTTTCTNTGTAAGCTTATGTACCTANNNGNNCCTGCACCGCCCATGGCTGCCGGAT 
CTGATAGCTCCCAAACNATTNGTTTCAACCACAACCCT^ATTCTTGCCCCAAAACCAACCACA 
TCGTAGCCCACCAGNTNTGTTCTTCTCTCCG 

SEQIDN0387 

NATNATCTCCGTGAGAAAAGACNCTAATGANTATNGNTTAANCTTATGCCCTATACTCATTC 
GACGACTNACACTGNAATAAAGCCGAGTAATNGCAAATGCATTTATTTATACTACACC 

SEQIDN0388 

TTGANTACNNTNNANTNCNGNCCTTCCNTNCAAACAACAGNACNNTGAGAAGCCATAAAAAT 
ACAGCTAG 

SEQIDN0389 

GTGTNTCCTTGTGTNATGCCTNTNTTNTTGTTTCCGCTATTGTACTCTCATCATAATGNNTA 
CCATTTTTCTGNAGATTCTNA 

SEQIDNO390 

CCTCACGTGGTCTGGGACAGGGNACCNCGCTGGGCTGGGGCATNTNANGGCTCATATCGTGG 
CAG'AGGACATGGCACTACACGAGGTGGTCGCGGTCGTGGAAGTAGCAGTTTGGGGCCGTGTC 
AG 

SEQIDN0391 

GGCCTGGTCNGTGTACTTANACAAAGTCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCT 
GCTTGACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAG 
GTGGAG 

SEQIDN0392 

CTTAGCANCACAGCTGCTTANCACAAAGATACCAGCCCAGGGAAGTTGAATTTGNNTGTCTA 
CAGCNAAAGCCATTGCNGANGNAAAGCCCCTNGTTN 

SEQIDN0393 

TACAAAACGTNTTCATTCTTTCNANTAAATCTTNTATTNTTATNAGAGACATGGGTNGCCCG 
TTNGANGGAGTACTGNTGTTCTTCCTCNNGNTNAGTTGCNGAATATTGCANTNGCTGC 

SEQIDN0394 

GCTCTACAGAGGACAAGNACTNATATCTGNAGACAAGAGGGAATTGCAGCACTCANGATGTG 
GTAGAACGGACAAGGGAGTTTCCTCTNNTGNTCAAGTGATNTCTCTCTTC 

SEQIDN0395 

CCTTNGNTAGGCCGNCGACCTTCAGGANAACCTCNNTNCNGGAGACCGTNNCTNTCGNCNTG 
NTGATGGCCATNNNTTNAAACGNNTTGTGATG 

SEQIDN0396 

TGTAGTGAGGAGANTGAGGCTGCAGATGAGGTGGCTGGTAA7\ATCTGTGATGAATTTGATTC 
AACGGTAGTGAATAGTCATGTCAAAAGACTACCACTTGCTGATGTAACTGATTCATATCTGA 
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ATCTTCCTGCTTCAATCTCTGCAGCTGAGAGGTCTCATGCTAGGGGAAGTCTGGATTCTGTC 
AAGACAGATGCTAGCTGCACTGGGCATCATAATAAAGCCAAAAGAAAGCTTGGAAGTAGC 

SEQIDN0397 

GCAATCTNAACTCCCGACTTNGNTGNGTNCTGATNTCTGCTGTTGAATCGGCTGTTTGGTGG 
CTGAACTCAGACCATCATTTTGGTCCATTTCTTTGATGTTGTTCTGCTTGTAGTTGTCCTGA 
AGTATTTATGGAAGTTGATTCAAGTCTAATAGTGGCCTTTACTCTGCATTTTAGCTGTCCTG 
AAGAATTTATGGAAGTTGATTCAAGTCAAATAGTGGCCTTTACTCTGCATTTTAGGTACGTA 
CAGGTCAACTGTAATTCTCTGTTGCATTTCTAAATGAAAATATGGGTTATCTTGTCATGTTT 

NGNG 

SEQIDN0398 

TTTTANGCAAGNNTNNCCTCCCANGAACAANCCCTTAGTCCAGNTTCAAAG 
SEQIDN0399 

TTTTANGAGNAACTAAATCCCCTTNTNCCGANCCCNTGCAAAANGNGGNCTANACNGNNNNN 
NTGAGNGNNNAATNCNAANATNAAACNCTGCNTTCATTCTTTNCCTACTGATATGAGACTGT 
CAATNCTGNCAGGGCAC 

SEQIDNO400 

TTNTTNGGCTCGTCAGGGNGATTCTTCCTGCNTATGCTGATNATGAGTTGACCGATGTTCAN 
TGTTNNNTAGANCTGNCCNAGTCCNGGCAATGTNNCAAGTATATAGTGGCACTGCNCGGTNT 
TATGNCAACATCAATNCTGCGAAAAGCTTCACC 

SEQIDNO401 

NNNCACTNCTAAAGCNCTCTCCTAANGACCCCCAAGAGGANGCNTNTACTAGACATNCNACT 
CAGGCGNGATCCGCANNCCTGANCCGCGTATAGCTGGTATGATNGGNCANCCAAGGATTNTG 
GNNTACGAGGGCCGTTANGTGNGANANGCACAATGNNGGACAANANNTGNACCTNANGNGNN 
ACAACNCAACCCAAAGGCTAACTATGCGAACCAGACACACCTACTAACGCTCTACTATGTGN 
CACAAGCTGTGCGGTACGACAAGGC 

SEQIDNO402 

NGGCGTGGTGGCTGNAANGGGTCTNANGNTGCC 
SEQIDNO4 03 

TTCTCAGGNAGGCGGGGGTGNCATCNCTGAACACCANAGGCAGNTNNCC 
SEQIDNO4 04 

AACCTCTTTTCTAGNAACCACTCTCTNAATNTGTGGTNGGCGNTTNCA 
SEQIDNO4 05 

ACCNCNANNCNTCNGAGGGANANGCCNACNTNNTGGCNGTGGGCCCGGAANTGTNCNAATAT 
AA 

SEQIDNO406 

TGCCGGGGTTNTCNNACAAGAATGCCNNNCNCTGNNNCGTGTNTGTCTGNNCNCATATGCNG 
GANANGNNCNTGNCCNAAANNNGNCATNGTGCCTTNCAGTAGNATNANCNGATCANCTNTNA 
GAGTNNCCNNNNCAGGNNNNCNNCAGNTNGNTAGTGTNTNTGCTNTNGATNTGACCTTACTA 
TAAANATGAANCGGCACNACCATAAGGTATAAATGTAGGCACANTGCTTGCTCTATA 
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SEQIDNO4 07 

CTCGGTGAATGCACCATCCTCANTTCAAAGTGGTTGCTATGGTNTANCAGACANCATATCGG 
TNACANNNTNCGAATTG7^ACGAAGAATTTGGNGGTAAACTNTGTCAGCAGAGCATGAATGCT 
GGTTTGTCTAGTGGAGTTGAGGTTATTGATGTTTNTACTCCTCCATGCTACAAGGTAAGTGG 
AGACAGCAAGAA7^AGAAGACTTTCTACGGCTTGTCTTGAAATTATTGATTTGACAGACTCAC 
CTATTTTTGTCTGATGTAACTAATA 

SEQIDNO408 

TGANAGAATGGGTTCTANTNAGGAACNATGTNTTGTA 
SEQIDNO4 0 9 

ACTCNCAGTTGNGNGGTGCGNAGTAAACAACTAACAAGANTGCGNAAGCATTCANGAGGACC 
CACTGTANGCTTATN NC ATCTNGATCAAAATCAGAATGAAGTTATTTCTACTCTTG 

SEQIDNO410 

TGGNAGCGCCGCGTAGCGAMAGGNACTATAGCCTGGGGTNGTATAGACACNTATNGGCTGGC 
ACANCTTCTNACA 

SEQIDN0411 

TNNCATTGAATNGCCCTACATNTACCAATNTGNAATCNACTGATACTTCTCAAAACATATCA 
NTGNCTTGCCCACTTCATTACGGGNTTGTATGANAANCCA 

SEQIDN0412 

TTCCCGGCCTGGTTNCCCTACTNATACTCNACCATACCCNAGAAACCCNT7\ACCTAATTCTT 
CATTNNCTCTCCNCATATCATCNTCAAATACTCTNTNCACANATTCGTTCCTTCTACAACTC 
CATCACTTTNTCCCTCTCGCCACCGTTCCAAGTATTGCACATGGGTGANAGCTGNTTNATGN 
TCTGTNGCTGNGACAGATGAACAACACCATATCGCNAGTAATGGACTAGTACACAAAGAATA 
TGCTGNCC 

SEQIDN0413 

CTNANTGGCGNNATCAGTGCTCACA 
SEQIDN0414 

GTTCTTTTNGCAACTTTGATCGGGAAAGGGCTCNCA 
SEQIDN0415 

NTACTTCTGTTTTCTTTTTGTGTCAAATATTGTTTGAACTCTGGGTTTTCTACCACGTGCCA 
CGGTACCACTGA 

SEQIDN0416 

TGCAATAATGAACAAAGCAAGATATCAGTAGTGATATCTTTGTTTTAGAGCATCTTTGTTTA 
GCTGCTNTCCACTANCTACAAAATTGAATATTGCAACATTTGTAACCTTATTTTTATCTTGG 
CAA 

SEQIDN0417 

TCCCTTGTTTNATGGAGCCGATTACTTTATGAGAATGCTCAGAAACTTCAAGCAATCGAGAC 
AGATAATCGCAGGCAACGAGCAGCTCTGGTGACCTTACAGGNNAAGGTAGATGCTGTTGCTT 
ACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCA 

SEQIDN0418 
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ACCCCNNTNTAAAGGGGCCAAAGGNANAANCTGCAATCATTATTCGATTGAAACAATCCTGC 
GATNNANACNNGANANNCTGANANATGNCTNAANNNAAANATTTGTGCTGANNGGGGTGCTN 
TTCNNCATGAGGANTANATNNTNNCANCNNCTNAAGCTTCTTTCCATACTGGA 

SEQIDN0419 

TTGCGTGGCAGTTNGGGGCANAGGCACTGGAGACAAGGGCNACTCCAA 



SEQIDNO420 

AAGCAACCTTGAATCAGACTCCTCACTGATCTCTCCTTCTCGTCACTGTTTCTGTGTGTGTG 
TGTGTGTGTGT 

SEQIDN0421 

GAATATGGAAGATTCCGAAAAAGTGTCAATAGATGGCAAAAATCACAATGGGCATGCAAAAT 
ATAGTTTCAAGAACACAAATCGGAGGAAGATGTTTGGTCACCCTGAAAAATTTAGTTCAGTG 
GAAACTGCGATGTCTAGAATAAAGAATAAGAGTCATAGACCAGCTGATAGTGATGGAGAGGG 

TGGAATGT 
SEQIDN0422 . 

TNAAANANAATGNATTCCNCTNGGGT 
SEQIDN0423 

GAATTCTTTGGTGTNCATGCGAATTACGCGTTCAGTTCTTATTGGGCTCACGT 
SEQIDN0424 

AGGATACAAAANCGAANCCNNTGNGTGNCTACACTGCNGAACTGCGTCGTTGCAGGGTCTTA 
TTGGGCTCAGT 

SEQIDN04 25 

NAACAATTTGAAATAATATATTTCGTCAATGCAGCTTGCAAGCTGCAGAGAGGAGAGTCATT 
ATAGTAACTTTATAACTTTTGTTTCAGTTTACAAACCTTGTAAATTTTGACCATATTGAAGT 

TCTCCCTTCAG 
SEQIDN04 26 

GCACATTNTCACATCTTTACTAANATAAGAAGATTNCTGTANCATCTACTAAGATATTGCAN 
AATNNTATCAGCNAGAGTGTTGACGCCGC 

SEQIDN0427 

GAGACTTTCAATTGCGTGCNTGCTNTNANCAAGCCGCGAGACANTNCTAATACTNNGACNNG 
CTGGNAATGNGNCATCTNGNNNNNCTANTNAGANNCNNANGCNCACAANGTNNACTGTGTCC 
TTCTGGCTGATGNCTTCCCNAGCATTACGTGNTGTCTGCGGCCTGAATAAGATACTGCCTCT 

GCAAATCC 
SEQIDN0428 

CCTGCTNATTGGANGGAGCACTGAGGGTGGTACTNNTTGCAGGAAAATGCCTGTCNTNNGNA 
CNCAANTNCANGCCCGNNNNGCACGANGTNGATGCGGNACNANNGCNGCNTNATATCTGNNN 
NGNATCGNNANGTGTNACACGCNANNGANAGCACCGGNTANNTNTTNNATCCTNTGCCGGTG 
TACCTTTGANNTNANANTCCTCNTGTTACCNGANGNCANGTGCTTCTNCTNAGCTTGNTANT 

TGAANTGGNGTGAGAATGAATGACCAGCNGCT 
SEQIDN0429 
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NAGNTNTAATTACTCGGNC 
SEQIDNO430 

TTTTANNCAGTATAANTNCCTNCCCCTTAAACCCCCCACTGGAC 
SEQIDN0431 

TGGNACTTTCTNCTCTTCAAAAGCTTTGACTCTCT 
SEQIDN04 32 

TGTGCGCNNNGTGNATGTATATGTGGTCTNGGCTCTNAGNCTGNCT 
SEQIDN04 33 

TGCAGCTTGGGGAAGACCAGGATATGAGCGNCGGAGTGAGCCACTCCATAT 
SEQIDNG434 

GACAACACCATCAGGTACAATGGCCAAGTCGCAGGCACTGGGAGACGTACCAAGTTAGGAT 
SEQIDN0435 

TGAACTAGANATGTCATTCTATAGCNAGTATTCAGCCNGTGCTGTGTNTTANCATAATATNA 
AGAATNTTTCTNACTTACGTGCAGGGGAT 

SEQIDN0436 

NATAGAGGAGGACCCATCTGACTCCCGTCTTCTTCTTCATTAGAAATGGGAATCAACATCCA 
CGAACAAAAATGCTATCGCTAT 

SEQIDN04 37 

TCAGCCTCCCGGCTTTAACCTACTGNGGGNACAGNATGTNGGAAATNCCNGCNAAGCTGGNT 
GGNT 

SEQIDN0438 

GGNNATGCNCATTGGAAACACNCGAATGAAACGTTTCTNTGCGAAAGTACTCACCAACGAGT 
GCCATTGGAAAGATTTCTATATTGTTATGGAACGCCTAGANNNCAATACAGTGNNACGCAGC 
ATCT 

SEQIDN0439 

CATCTCGCAATGTNATCCAGNGTNAGCTAACNG 
SEQIDNO4 40 

TATAATCCNGCACTCNCAGGANCGCAAATAGNTGTGNNTGATGGTTATTNTNGTTATG 
SEQIDN0441 

ACCNAAGATCCCCCNNTNAAACACCCAATCCCCCCTNTCCGGCAATGAAGCTGCCGGAGCTG 
ACATTGATCTGGCCGATGTTTTCGCCAAGTACTTGAACCAAGGTACAACAAATGATAATGAT 
CATGATCAAGATAATATTCTTCAAGAATCTCCCTTGGCTGATCAAGATTATTGTTCTATTGG 
AGCAAGCTTATCAAATTCTCCTTCATTAGATAGCTTGG 

SEQIDN0442 

ACCANAACNCAANNGAAAGGGCCCCTACTTATAGNNNCCAAGGAGGAGNACAAGTTACTGAT 
TGG 
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SEQIDN04 43 m ^, m/ _ 
CTATTAAATACCTCCGGGTTTTAAANACCACNCGNCTATATTACCGGTTCCGAANCATTGTG 

CNG 

SEQIDN04 4 4 

NCNCGGAAAGGCCCCCCTTNGTGGGGNAAACGACCCGGACTCTCNGGCNGCCC 
SEQIDN044 5 

GTAAGGGTAAGGTCTTCGCTACAACTACAGTCGTTTGGTGGGTAACAACCATCAATAACATT 
ATCATCCTTCTCAATCTTAGCCG 

SEQIDN04 4 6 

TCTNTGGNAAAGCCCNTGAGANATTGGGAAAACTNAACAAACNGNTAAGCAGCAGGAGANCC 
NACANGNNNAGNGAGGCCATTTTTTTNCGACANCNGNGATAACAAAAGGAAGCAGGNGGCAA 

ATTCGAGCTCAGACACNGAAAACCAGNNTCTNA 
SEQIDN0447 

ACTGGCNTNTGCNAGCGTTAGGTTGCTGGTTGTCCTTTNCTTTTNCACTATNNTTTTTGNGC 
TGTNNNTCTTCACCGTTTAGGGANCATTACCCAGTTNCAAAANCAGCTCNGTNACATCCGNC 



CTCGGCATAATCGTGCGTTATATCGCTGGTAGTCCGAAACATTCACAAGATTATTTTTCTGC 
TGATGCTCoGCATATTATATGGATTCTTTATTCATCGATATTGGCACTTGATATTTTCTGAG 

TCG 

SEQIDN04 4 9 
NCCGCAGAGTCCCTGCAGC 

SEQIDNO450 

GGTNTTGGANCTCCATTCTCTATTAGCCNG 

AAGGTGANGTCNCAAAGANNTGACCGGGGCCTGNNTNTGNTCNGNNNACAGGCATANCNGNA 
GACNGAAGCGANGANGACTNAAG 

CANGTGCAAGANTGTTCNTCGAATATTTTTGTATTATATANGCAAATAGTAACCCCACACCT 
ACTAGTTGTTTCTAATTTTCATTTTCTTTTCATTTGTTACTGTTTCGATTTTTTTCTTACCA 

TGTTGGATAAATAATGTGTTGACTATAA 

TCNNTGAGCTNNNTTGCAGCTTCTAGCNGANCTTTNTTTGCAGCGTCTNGCAGNNGNTTTNT 
NNGCCATGNTTGTTNTTCCTNTNCATAGCCCNGTGTATTTTTGGCTATGANCCTGCTCTAGT 
NTNCATCTGCCTTCAGCGTGAGCCTNGTCAACTACATTNTTCTTGGAA 

NTGNGGAGCATGAGTTTATTGCGTTTGATGGTTCACATGCTAAGTCTGAATACATTTACACC 
GTTTTAGATAACCTAGTCGGTCAAANACAACACATTACTATTTTTCCAGATGCTGATTCTTT 
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AGTTCTTGAGAATAGCTGAAAGTAATCAGAGTTTAGATATGCTGAACTTCCAATACAGCCTT 
AG 

SEQIDN04 55 

GATGGCAAAGCAACATTGNACAGGNTGAGGACTACTAGAATATTANANGCTNNTATTGGGTA 
GGNCATACGTTGGTNCTGTGAAAGGGAATCAATGCCNTGNTNTNNCTNGCNNGANNTNGAGC 
NTNNNGGNGCACAAATGNNCTATAANNAGCCCTNTNATGNAGGNGGAGNNCACAAGNGNAGG 
ANGTGATGCCNANCTGACCTAGCTTGTGTAACACAGGNTCATTGANAG 



SEQIDN04 56 

CAAAGAGTGAGGAAAAATGGAAACTGATTGCGTGGTGCTACCGTTTCACACGGTATACATGA 
AAAGAAATCAAGTCAGGTATTTTGACAGTGAGGATTCATAACAAACAACAAGATCATATACT 
CTGAAGTAGCCGAATCCAGGGAGCTGTTGATCTGATCTCGATCCCCAGCAGCGTGCAGGTGA 
CTAACAAAGCTAAAAACCAACTCTATTCAAGAGCTGGAGGTGCTTCAACATAATAAGTAAGG 

GCTGTTCATTCTTGATTCTTTCAATTAG 
SEQIDN04 57 

CTTTGCCACATTCTCGGCGNCACTNGTAAGTAG 
SEQIDN04 58 

TGNGCANANAACANAGGACTNAGGCAAGCGNTANTATGGGGANNGGANCCNANGNGGCNCNT 
CAAGTGNANTC 

SEQIDN04 59 

CCCCGATGCCTTCAGTAGACAGAAGCTCACTGCTGTTGCACCAATNTNCACCCCGATGACTC 
TGCCAGAGGGCGAACTAGTTGC 

SEQIDNO4 60 

NCCGGAGAAAAGGTCGAAACCGACCGTAGNTAGGACTNAGTTTCTCTTNCNGAAAGANCNTG 
ATCGGGCTCTAGNNCANAACCNNGGNTTTNAATATATAATAGANAAACTTCTTNNGNANGTT 

ATG 

SEQIDN04 61 

TGAGGANAAAGAAGGNTACNGCNCTTNCCGATGNACACNCAGNAGGATGANCNATNNNACNG 
ACTCTCNATGCTGNNCGATGNCCAGAAGGTGAGCAACTGGAAGANTTTCTTCTGTTTTTNGT 
NCTTACATATNTGAANANNAATCANNNAAGTANGANCACTAAAACNAACCCATANTGGTCCA 

TAANCTNTNNNCCTN 
SEQIDN04 62 

GGGTAATTCAACAGTGTAGATTTTTTTCTAGCTTTTGTAGCAAATGAATTTTTTTGATCTGT 
TGTTGTACTGTATCCAAAAACAAAAATGTTGTTCAATGAAAGATGAAC 

SEQIDN04 63 

TCAANGAAGCTCTCACCAGTCTCCATTAGTAGAGTCTATAATTATGC 
SEQIDN04 64 

TTTGANNNCCCAANGAGNANCNCGNTGAAAAAGGNCCTGATGAATTCACCACCAATGCCTCA 
CAATCTTTGTGGNGGACTAAANTGTTTTTGCCTTTTNTTGAAAAAGCCTTTGCTCAGCG 
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SEQIDN04 65 

NGTTGNAAACATGCNGCCNTCNGGGTCTATCCAGGAATGCGATTCTGCCAGATGCGATTCCA 
CACGCTAGTCGGAAAAGTCGATAGNTATAAAAAGAAGGGCAACTATCAGGGTGAGCTCGCGG 
AAGGACCTGTTCCTTCTCGTTCCTGGAAAATGTTTGAAGACGAGAGCGTGC 

SEQIDN04 66 

GGCTGCCNTCAGTCCACCCGGAGACCCAAGGTAGACCTGCAGGCGTTCGCGGGGTCTGGCGT 
CTCCCTCTATCTCTATTACCTGTTTCATTTTCTTTCGTTCAAAAACAGTTTATTGTATTTTC 
TTCAGGCCTTGTTTGTAGTGACTCTTAGATAGTATGTGACACTATGACACCAGATTTTGGGT 
ATTGAGGTTTTGAAAGCTGTAATAGATATAGTCTTGAGTTATAAAATTTGTTGATTTCCGC 

SEQIDN04 67 

ACATCTAAAGACGGCAAAGTTCAAGAGACTTCAGCTCTGGCATCTGTAAGTGGGATATCAAA 
GCGCATTTCTGAAACCCCTCAACGAGAAAATAGAAGGACTAGAACAAAGCTTGACCAACCAT 
TCTGTAGAAGCGACACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTCGAATCAATA 
AACGAGAATGGAAACAGAACATGTTCCCGTACTGTATTTTCTCCTTTCAGC 

SEQIDN04 68 

TANGCANTTTTTNATNGTCGCNTGTANAAGCCNCAANTCNGATCGGNNCCAACCTTCTGAG 
SEQIDN04 69 

TGTANCTTCTTNNGCTCNTCNGNTGGNTGGGCAGTCTGNANTNATCAGCTGNCTTCC 
SEQIDNO470 

GAACAGNAGAANNGGAAGNATANGGAAGNCGAAGGAGTGAGCACAACGGCACCACCATGNCT 
CGN 

SEQIDN04 71 

TTTTTGCTAGGGATGGTTGGACNNGTGANTTTTGNATGTGAGTGCNTCTATCNTTTAGCANT 
TCNATNAACTTNCCCNCGGAAGGNNTTATNCGNGCNGAGCNTGGNNCNATATTTTGT 

SEQIDN0472 

TGGGGGCAAGCACCNGCGGCGGAGNGGAGGAGNANGTGNNGGCTTNNCAGNNNANC 
SEQIDN047 3 

TNCTTTCAAGAAATCNATGGTGATGAAAATCTTTTTGNNGNTNCGANATGAGGATTCATTTG 
GAGNTAGACAATTACCAATTTTNCTTTGCCTTCTGTAATAA 

SEQIDN047 4 

TTATTACTGAGCTTCATTTCTCCTGCTTTCAATCATATGCATAGCATGTAACACTTAGTTTG 
TTTCTAGAAAGATTCTGATTAGTATATCTATCAACGAATAGGATGTAACTAAAATCTGGAAT 

ATGTTAGTTTA 
SEQIDN047 5 

CATTATGCGGANTTACAGGATNANTACAACGACTNATCTGANAAGCATANNTTGATCTTGCA 
GGNNNTACANGATGTNAANNTGGNTGCAGCAAAAGCAGGAAGAAAAGGTCATGGTGCTNNTT 
TNGCCAANANTCTCNCTGCGGAGCTCTCANCTTTGAGAGTGGAAAGGGAGAGGGAGAGGGAA 

ATGCTGAAAAAGGAGAATAGAAGCCTTANAGCTCAAC 
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SEQIDN047 6 

TATCN CAAN T AC TGGAC 

SEQIDN047 7 

NTTAGGTTAAGTACTTTATTTTGAAC 
SEQIDN047 8 

CCAAGGAAAAAGGAAATCTTGATAAAGGACTTTTGGAATGTTGTTTGC 



SEQIDN047 9 

AGNANCCCTGTTGTTTCATCGGATTCGGCTACTGCCTCATCAGAGTTGCTCTCAGATTTCGA 
CCGGAGTTTTCGTTTTCTGATGGATTTCA 

SEQIDNO4 80 

TCGGNATATAATATCACCGCAAATGACCTCGACTCTCAAATGGCGACCTTGACCGCGAAACT 
ACAATGATTCAAACTCGAAAAATGCTCAATGATGTTCAACCTGCTTTA 

SEQIDN04 81 

GCCCCTTNAACAGCAGCAAAAAGGACAGCAGTCAATTCCCCTTTTCCCTAAGACTGCCAATG 
CCTAGTCAATCCATCATCTATCTAATCGGAAGCAGAAAATACCAAGGCTTCCAGAACACCAG 
AGCATTGTCACTGCAACTTGGTGGGCATTTTCCATTTAGAACTGACATCTGTTGAGTGAAAA 
TTTTATAGCGCACTCTTTGCACATCTTACTGGTCCAATAATGTTCTTCCAATTTGATGCTGT 
TTCTATGCTAATCCAAGACCTGTTTCCCGTCTCCT 

SEQIDN04 82 

AGACGCTGTNAAGTAATGAATTTCTTGAGGACGCTCATCGAAAGGACC 
SEQIDN04 83 

TGCCCTTTTNCCAGCCGTGTGTTGNTATTTTCGTCACAAAGNTTATCACAGGTCTCAAAGAT 
CACCAATNAAGAGC 

SEQIDN04 8 4 

TTCCCGCTNTANACGCCCTTATTCGAGTTTGAGGATCTGTCNAGGTCGAGTTTACGGCGAGT 
CAAGTTGTAATCTTGTTGTTTTGACAACGAGTCGATGTTTTTAGTCAAGTAACNCAATACCA 

AAGGAAATGGNC 
SEQIDN04 85 

CTANCGGNAAATCTCCTTCTTCACAAACGAAACCCTAGCAAAACTCCATCTNCATATCAGGN 
CGTTTCAACACTAGAGACCAAAGGAATGTCTCTTCAGCCAAGAGTCATGCCCTCCCATCCGT 

TCTGCTTCTTCACCATCTTCA 
SEQIDN04 8 6 

CTCCTNCTTTTATTTTACCGNTAGCTGATATTGTTGCTTTGATTGGCTTTCTAAAAATTGTA 
AAATGCATATTTACGCTTGAATTTTCAGAGATGTATTTTGGGTGATTGCTTTGTTTATTTTG 

AGAAGTAGAGATATTGAATTCCACC 
SEQIDN04 87 
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AGGAAAATNGTGAGAGCAAAATAAATGAGAGAACGAGGAAGAAACAGATATGGATATGAGAA 
AACGATNCGCTTTTCTTCTTTCCCATTCACCTGAAACCAAAAACACCTCTCTCATTTTAGCT 
ACTGAAACAATCACCAAATGTCACCTAAACAACCAGAAAACCTCCA 

SEQIDN0488 

ANNCCCNNTTTGAGGGANNNNGGCTGGGNCTGATGNGTGTGATGCTACGNACTTANGANNCN 
ATGCNGAAAAAANGTATATCTACGTNGGANGGCCNTTGNTNCCTGGNGGCGNAGATGNCGCN 
ATTTGTACTTAGACACATTTCAAAGCATGTTGGCNAANGGAGATTGNGAAANTNTTGNTGTN 

AAANTTAGTCNTNAGNGTTACC 



SEQIDN0489 

TNCCCGGTTNGTTAAGNGACTTCAGCTCTGGCATCTGTAAGTGGGATATCAAAGCGCATTTC 
TGAAACCCCTCAACGAGAAAATAGAAGGACTAGAACAAAGCTTGACCAACCATTCTGTAGAA 
GCGACACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTCGAATCAATAAACGAGAAT 
GGAAACAGAACATGTTCCCGTACTGTATTTTCTCCTTTCAGC 

SEQIDNO4 90 

TCAANTGANAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGGGAAAAGCCTTA 
CAGGAGTAGCTTCCGAATGGTTTATGGATCAAGACACGTCTTGTCAAACAGTTCCAATACAA 
CATTGACATTGCCCCAGACCGCAATTCCCTTTCAAACTTGAAGAAGAAACCAACTGAAAGTT 

TCAGGGAATATGCCA 
SEQIDN04 91 

GGAANCGGGAATTCTTGATAAAGGGACTTTTGGGAATGGTTGGTTTGGC 
SEQIDN0492 

GCNTTNCGGAATTCCTCTCTCTATATGAGACTGAAAGACTATGTTCAGGAACTTGCTAAATT 
TGAGATTGATACACACAACATTATAA 

SEQIDN04 93 

NNCTGGTAAGAAATAGATGGTGACAGAAAANNTTTNNGGNGTTACGNTNGANGATTCATTAN 
GGGNGANAAANACCAATTTTCCTTTGNCTTCTGTANTAA 

SEQIDN04 94 

GGGCCTTTAGGGAAGGATGCTTTGTTGGCTTATGGTTATGA 
SEQIDN04 95 

TTNNCTCCANTACGGAAACAAGCACCGGCTACCGAGGACTCCNATATGACACGAGAACTTTT 

CAGGTTTGGCGCCCGTT 

SEQIDN04 96 

GGTATGGAAGAGCTCANNCNAAACGNGAGGAANTTTNNGGAAAACAATATGGAGCNTCAACA 
TGGATAGGAAATGTCAAANGCTTGGGCGCT 

SEQIDN04 97 

GTCCGAACACCAAGAGAGAAACCCAGTGCCAATGGAGTTCAATTTTCATACTGAAAAGAGGA 
T T CAT C AT AATC CG C AAT T GAT C T GT 

SEQIDN04 98 
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GAAATACACNATTTCNAGCTGNNCCCTNGAATGGATGCCAANNNTGCTAATGCTNGNCCAAT 
GACNGTATCGANAANANGTCGCACACNAGAATTGAGGCTNACAGGGATATGATTACACCTGT 

TGGAGACGCTT 

SEQIDN0499 m „„„ mit , 
GAACANTGATGNTTTCCTCNNGGNNGGCTAAGGNNTNCNCCNACCCNGACAGGGCNTGGATT 

NNGGTTCTTNTTTCNNCGNGTCCCNNNNAATCTGACTTTGACTACTAAGAATTNCATACGNG 
TGGGGT 



SEQIDNO500 

TTATGTTTCTTGAGTGTTTTCTGTCTGTGAAGGTTTAGCTCACACCAAGTTTTCTTTTCATT 
TGCTAACACCAATGTTCCCACTGAAATGTGGGACAAAAGTAGGAAGCAAAGGGTGAGAGCTG 

CT 

SEQIDNO501 

GCGCCTTTGTNTATAATGCACCTTTTTTCTTCTGAAAATATNCTCCTGATGATCTTGCTTTG 
GCNCTATGAATTCATTATTGTTTGTGNTGAATTGGCTAAACCTAGGGGTACCAACTTTTTAT 

TCCTGAAGTGGTGGAACATTTACCTATCTTGTTT 
SEQIDNO502 

ACTAANCNNNCCCCATAACTNCGNTTAATNTACATCAAACCTGTACTCTCTCCATGTAATGN 
GGTTGTNAGATCACTGTTCTCTATACGAGGCTCATTACATACCGAATATACGACCCTCTTGN 

TTCTCTTTTGGCTGT 
SEQIDNO503 

NACNGCGAGNGATACTCCNAAACNGNAAAAGAACTCCGGAACACGCNTGGAGCANGAGATTT 
TTTTGAGCACACAAGGCGGAGCCAAGCTCTAACAGNCNGCANGAAGGAAGNGATGCATGGTG 
AGAGTACAGGCGAGAACACATGACATCTNTAACATACTCTCACATAANCTNGAAACTGACGT 

GTNNNACAGAACTNAATGCT 
SEQIDNO504 

NATCCTCCCNTCNAAAAGCCCGGGTTGCCAGGGNTTGACGTCTGACCGATTTGCAGAAGTAT 
CATTGAATGTTGCTCGTCATATATCTGCAGACTTGGAGAGGNTTTACCGCAATGTGGGGGGT 
CAGCCGCAGGAACAAGCGCCTTGATTACAGTGATGCTGGTGGATTCTACTGCAGAGATCAAA 
GTCTTCTTTAGCTAGCAGTCCTTTTGATTATTCTTTTGTTATCTTTGAGTTTGTAAGAGTCT 

NCTGNTGTTTTGATCATGNTATTTTGCCTTTTATTT 
SEQIDNO505 

TTNTGGAGAAAGGNGTGTAATGNACATTGTGTGTANGCACAACATGGATTTTGT 
SEQIDNO50 6 

ACCTGGTTGTTCCGANCCACCAAGAGAGANNCCACAGTGCCNNNGGAGTCCANTTTTNATAC 
TGAAAAGAGGATNCATCATAATCCGCCAATTGATCTGT 

SEQIDN0507 

TGTGGCAAAACATGTAAGCGAGCAGCTAATCAACAAGCTTGATTCGGAGATAGAAGCCGCTG 
AAAAAGCTCATGAAGATGAACCATGACATAGCTCAAAGATTACTTAGATATAGTAGTTCAAC 
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CTTACTAATTTTTGTTGCATAGTGCAAATAGACTTCTTGAATGCTTTGTAGAGGTGAACCCA 
AAC T T G T CAT AT C AAT T C TAT AG T G 

SEQIDNO508 

AGGCNCTGCTNCTGGGTCCNACTNTGCTACACAAGNAANAAAANAGCAAGCTCTCGTTGGTT 
TNCTCT 

SEQIDNO509 

GGNTCGGAAATCNCGGATGNAAGNCCCCAAGNCGNANGATNNNANGCGCAGGGGTATAGNAT 
GANANNCCTATGCTATANGGAGCTACAGTAGGCNAGNTTATTGAGGCCTGACATTNCC 



SEQIDNO510 

GNCNCGGTTTNNGCTCCGCNATTGATCGTTACTGTGACTAGACAGAAACCTGNANGTCTTCA 
NACTTTNACAAAAGGAANGNGCTGACAAGGCAACAGGCCTTCCATCCTATGATCACGNAGAA 
TCAACTNTTGGAGCATTTGACAACATTGCGCTATAGCC 

SEQIDN0511 

AANCCCTACTTTATACATGANGTNTGTGAATACTTGTAANGGAAGNATNNNGANNAGNTTGG 
GATGCNAANGTATGTTCTGGTGTTATGCATNCTNCNANTGCTCTTGCTGAAATCCACAACTA 
NAATANTACTTGCACTACATTANGGCTGTNNTTANNCAATNANTAGTTTTTTGCTGATTTGC 
ANCTCCATGTATNGATAGCNGAGNGTNGACAATCNANNATTCCT 

SEQIDN0512 

NANCCCNCTGTAAGCTCNCTNAGGACTAGTNTAAAGGGGGGCAAACANCTGATGAATGCCAA 
CTGAGAT 

SEQIDN0513 

NNCTTTTTTNGTGNNNCATATTNATGTTTNTATNACAAAAGANNTGTNTAA 
SEQIDN0514 

GCCCCGATNTTTTAGGGNNAAACTCTGCATTTNTGAANGGAATGANGTCTATACGCATTGA 
SEQIDN0515 

ATNCNACNNTTGCNATGCNTNGTNCNGGGACTTGAAGCCNNGCAATCNNCTGNGGAATGCCA 
GCTNNGAT 

SEQIDN0516 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0517 

TCCTNATNTTAGCGGCCNGNNTGCNGTTCTGGTCANTGATGCNACTNTCGGNCNAATATNNT 
G AT GNGT GCGAC ANN GGGA 

SEQIDN0518 

ATGTNCCGANNTTGTTATCCTNGCATGATNTANGGGAATGATNCTCTNNTGTAAATCAAGGT 
GCCGTAGGTAGTTNAGGGACANTNTATATAACATGCNGATATGNGTGTGAT 

SEQIDN0519 
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GCCGCTNGTATTNATCTGTTGAAGAAATTGCTGNTCAGTTTGTTCTGCAGCAGTATGACAAT 
CCACTTTCTAAGAAGCTCAACGATATCA 

SEQIDNO520 

CCTAACTNTAAGGCCGGCAAGTTCAAGACCAGTTTAGCAGACACTTCCAGAAAATCGCTTGA 
TGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCCATACA 

SEQIDN0521 

CCCTCTNTNATGNCCCNNAGCTGCTGTGTTAAAAATAGAGNCCAAGAGCTCATAAGAATNAT 
GTCCGAGGAAGGATTATACTGTGNCAAACAAATCNATANNTTCATNGTATATNGNGNGGGGN 
ANCAGTGCANCAAGTGTGGGGANTGGTTGCTGGAAAATATAGGATCAGA 

SEQIDN0522 

NNTAACAACCCATGNTNTANGCACAACAAGTGGAGCATATNCTAAAAGTTCCGGNGAAGAAC 
TTGAGAAGGAAAGAGAAAGAATGGTACCGAAATGGAGAGCGAGNGGATTT 

SEQIDN0523 

ATGCNNCTTGNNGTAACCTGCCCGACATTTATGCCNTCTNGNTTATGNTTGATGTTGCGTAT 
TCAAGTTATTGACATTTGGCTGAACAATTAGTTCAAGTTATTAGTTAGTATCTAGTATG 

SEQIDN0524 

TGTGCACATGNCTGATNGTGCTTGNTGGNTGTGGNTAAGGATATCGNNGAGCTAGNAGNACC 
NTACTTNGANCCGCTGNCATGATGGTTCGNTNGTNCNNGCTGCTGAGGNAAGACACTGTGTC 
NGCGGGACNCAACTCTCCAGCGCTTTATNAATG 

SEQIDN0525; 

TAAGGGCTGCTGAACACATCACCAATGACTCACAATCTTTGTGGCGGACTAAATTGTTTTTG 
TTTTTCACTGAAAAAGCCTTTGCTCACCG 

SEQIDN0526 

AANTCCCCCTGTAAAACGCCGCGCCAAAACTGGGGANAAAGAGCGGNCCAGCNNCCGATCCA 
NCGNTGAANNNACNGGNNGNGNCANNANNACNNGAGGGNANTTTNNAGG 

SEQIDN0527 

TCTCCAGAATCCTCATCAATGCTCAGTATGTATTAGTTCTTAGTGCCATTTTTTGAGAATGG 
CCAGNTTCAATGTAGGGTATAATTTATTGGCTCTTTTGGTTTGGCATTTGTGG 

SEQIDN0528 

AACGGGACCTTCGATCCAGACCTCAGAAACTCGCCGGAACCGTGACAAAATCCAACAACAAC 
NAACGGCTGAAGCTCTCCTTTCAGAAGTGTCGCTGCTGGTTGTTTTCAGTGAAGCAGGGGTC 

ATTGGTTTGG 
SEQIDN052 9 

NTGAGCNCAATTNCTGCCAAGGNCNGNACGGNCGATGNTGAACTGAGNCCNAGAGGNAGCNN 
GCACTTACCCTTATNTNGGGGANGNNGAGGTATACAAGGTATTTTAGTATGGTATTCTTTGG 
AATCATTTCCGCTCNGNCCTAGTTTGTTGNTTCCTG 

SEQIDNO530 

CGTTGGAAANCCGTGANGNNTNGGGANANNNNNNCCANAANAAGTCGCCTAGAGGNGACCGA 
NCGNGTAANCAACCTTT 
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SEQIDN0531 

ACGCNNCTNGTNNATNAGCCACTGAACCNAAANNNTNANCTCCGCACGATGCTGACGGCGAC 
GGNTACG 

SEQIDNC-532 

TCTTTNGAAAGNCCCTTGCATTTTNGNANAGGNNNCTTTNGCTTAGNCTTAGCAAGCTGNTG 
GGGAGAGTGGTCAANTNTTTNGNCAACANCTNAGCATNCACATGC 

SEQIDN0533 

ANTCCCCTGTNTTCTTGNTCACCNGTGTGGAGGNTGNACTGCTNCNTGGACAGGNCACAGTG 
GNGGACTGACNGTTGNNACAGCCNTATTGNGAGCG 

SEQIDN0534 

TAGCAAGGAAAGGGCTCTAATTCTTGCTCGACTCCTTGGGCGGCNTA 
SEQIDN0535 

AAATCNCCGATNNCNAATACCNAAGGAACATCAACAAANGACNTCTTACTATGAATCTTTTG 
TTTGATGTTTAGAGCTTATTTATTCTTATGATGTTGATGATGATNCTTTAGGCATCAAACTT 
CATACTTATATCTTTGTTATTGTATCTGGATGTTCAACTTCTAAGTGTTATGTTGTTTTTTA 

GTCTTTGAG 
SEQIDN0536 

NANCCCCCNTCNAACAAACCCrtTGCTGTACCCATTTNACCGNTTGCAAAAGACATGAGCCTG 
NNGGAAAAAATTTACGATTCTATCCTTGTGATGGTGAAAGTNTTNATTTATGATAAATCTAC 
CACTTTTGATTGGATTTCACGATCCAAAATAAAGGATGGTGTTGCATACTATAAGATTTTAG 

TTTGGAGATCGGTTTCCCTCTTGATC 
SEQIDN0537 

NACCCCAATNNAACAAGCCCGGGTACCGAGNNTCCNATATGATCGAGAACTTTTCAGGTTTG 
GTGCCCGANNTTAGGTTNCTCTTCTGTCTCGGCAATGGCTTTAATGGCCTTCAGTGCCAGAT 
CAAATTCCTCATCTTCACATATCATTCCGATTACTGGCCCATTGATGTGAGTAGGCAGAGAA 
TTGTTCATCATATTANGGGCCTCTTCATCCCTAAACATTATTTCCTTGACATTGATAAGGTC 

TTCCACTGCTCTC 
SEQIDN0538 

TTATNTATATTGTTAGACNTTGGAGTCTGAAATTAGNGNTGTTTGGGNTGTACGC 
SEQIDN0539 

GNAGAAATCNAATCNAAGTAGAGGAAGGGCGATACTGGGAAGGGGGGCCTTAGCN 
SEQIDNO540 

GGGNATGTCAAGTANGACANTATGGNCGANNCTNGAGCGTGCACNATGTCTATTNCAGCANC 
ACATTGANGATANCTGAGGANTGTCGCCAC 

SEQIDN0541 

AAGTTNNGCGANTATCCTTCGCTGAGTNTAAATCTATACANTCTTGAATCCTNATTACACTG 
TTAGAGAGATNATGAAAAAAGGACCTNTGAATCNAANNNCCTACTATTTTGCTTCGCCTTTA 

CC 

SEQIDN0542 
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CNNTNNACATTTAACAAGTGAGAGTTTGAAGCCCTTTCAACTTGCGCATGTGAAAGCATTGA 
ATCTTGCAAAAGGGGAATGCAGGATGATAAAGAAACTCTGNCATCTGTATGGAATAAAGCTA 
TTGAAATGTGCGAATCCAATTCACTTGCAANCTTTTTGAGAAGACAAGGGAAGTTGTCATCA 

ATTCGTC 
SEQIDN054 3 

CNCTTTGAAGNNCCACCATCGTACANGGGANAANACGGCNACCANAATCCGGNCAAATTCNG 
GNGNNCTNCCNGAACNCNTNTTTTTNTTTGGGTGCCACCATCGNACCGGNCAC 

SEQIDN0544 ; BSTC4-34-185 

CTGNCAAACCCNGGGNAGTCAGGNAAACGTCCANCATGGATCTGGATCNNGGCACAGNGAAG 
GCAACGCNANCGACMTAGNNACNNNANGACTGTATNAAACANAGNCNGGANTNATACTGANN 

NCANNNANNAGNNTANGAAGNTTCANGGCNC 
SEQIDN054 5 

TTGNNGTNGNAGGNGGAACGNAGGGCAGTTTNNTTCCNAGGGANCACCANCNANNNNGNTNN 
TNNNNNAANNTTTTTTGNTATANNCACACGGANNTNNNNACNANCGAGGGGGGNTTTTTTCT 
ACANTNNATTNCGTGGGNNANAATCAAACGATGANNNCNGNGNNTNCNGNGGANATGNNCGA 
CNNGNNTANNGNTCGACCNCNACCACNNNACNGGAGNNGNNGANNGTCGNNNCTCATTAANG 
AGAGNTTAANCNGAGTGNAGTNAATNACGNCNANANNGANATNTANNTTTTNNNCNNGGNCN 
NNTANNTANNNTNACNTANNACNNNNGTATNNTNCGGNNGCNTTCCCANNNNNNTNTANNNC 
TNNNTCGAATAAGANNCNCGGNCANGNNCNANTCCCNGCTNNNCAAAACACGNNAGNGGAGG 
GTCCGCGNAGGCAGTGAATCCCGTGATTNANCTACAAGTGCCTTGNGTGCAGNTGNCAANAA 
CAGGAAATACTTNTGGAATAAGTGATGCATNCAGAAATGCTACTTCTGGCTCCAAAGTTGCT 

GACTGCA 
SEQIDN0546 

GAGGTACATAGCAGCTACCAGGCAGTGATTCAAAGTAGAGCTGCATTCCGTTCGTAGGCCTT 
AGAATAACTTCCTAGTTCCTATATACTGTTTCCATTTTATTTCAGACAGTATTGTAATTCCT 
TTCCAAATATTGTATTTAGTATAATCCCGAAGCTCATGTACTTGTGACTTCACATATTGGGA 
TATTCGCGTTAGATGTTGGTTTTAGACTTATTGTGTTTGTATCAGAATTGCCTTTACGTTTT 

GTTA 

SEQIDN0547 

CTCTTTGGAAAGCCCTCATNGNGTGAGAANACNANGCGGNAAANNCTNTTGNNACGCCNATT 
ACTCAGGACNCATCATTTTTTTCNNNNNCACGCTANAAGGGGGACTATNNGGCCTAAGGANA 
TNCAGGNGGNNANGCGTANTACGGGAGAAAGGGC 

SEQIDN054 8 

CCTTTNGAGGCGGCATGGATGTAGCAGGGAAAGGCTCTAATTCTTGCTCGACTCCTTGGGCG 
GCNTA 

SEQIDN054 9 

CCNTTGNTGAGCCTATCTNNGTTCCGAAANTGAAACCGACGCTAACTTTCTCCACTAGTCNG 
CCTTTCAGTA 

SEQIDNO550 

CTTTGGNAAGACCGCGAAGTTGAAGGACAGGGAGAGATGANGNGCGNCTCCTTAGGGNACGA 
TCCCTANGNCNNACCGCNNTCACACAGNGTNTGGGGTA 
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SEQIDN0551 

GTTNATNATGCGATTCTTTTTCTGCCTANGGTGGNAGNGACCAAGGAATTGCAGGACCAATT 
TTTTTTGGGTTATNTATCCCTGCTCTAAGGGCACTTCATTGGTATAGGTTGNAAGTGTAAGG 

NNTATTGTTGGCTGGCTA 



SEQIDN0552 

GAANNCCCNGACNNATTTGGGAAAACCACCTGANGAAGAAANGATATGTNGCATNTAAAGNT 
GACTTATGAGTANNAGGCTANGATNTGTTACCANACCCGNGNTGGTAATCNNAGNACTATAT 
NGAACATNTTANTTGNACCTTCTNANTACATNANCNGNTATGAGNACCANTATTACNCNGNA 
CTTNATTNACANNTGCGNNGNNAGGANATTANNGGTGNCNCTNGATCGANTTCTGACTCATA 
NTNTNACNNANCNAATGNACNNNTCNAANGTNNTNANATNATNNNTCNCGTGAATCGAGNTT 
TAGCTATNGCNGCNNACCACGTGAAGAAGAAATGATTTGTTGCACGTAAAGCTGACTTATGA 
GACNGAGGTTATGATATGTTACCATACCCGAGTTTGTAATCTTCGCACTATATTGAACATCT 
AGTTGTAGCAGTTTTTTTTATCATCTGCTATTTGTGCATTA 

SEQIDN0553 

TNAAAAANAATGGATAGCACTAACACAAAGGCGGCAAGTTCAAGACCAGCTTTGCAGACACT 
TCCAGAAAATCGCTTGATGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCC 

ATACA 
SEQIDN0554 

TGCCCNGTGCNTGGTTGTGGCNAGNGNGCTAGANGANTCCNGANGAGGNGNAGACCGNGAAA 
CCCACCGA 

SEQIDN0555 

CTTTGGAAGGGCCNNAAGCTNNNGGGANTCNGCNATAGGGGAATNAACCNATGTGCATGCAA 
CAAACAAGCCGNTNNATGTCANGA 

SEQIDN0556 

CTTCGTNNAGANCAGGGATTGTTGNTTTCCAGCGNACGATTCGAGGTTCGGATTNGGNATTT 
CGATGTCTCANTCCANGGGATTGTTGCTTTGTTTAGCCCGA 

SEQIDN0557 

CNTGNCNTNCCGCNGCTGCTNCNGTGANNNCNGCTGCTNTACGGAGCTGATNCTGTNNNTGT 
CAAGGAGGNCGACACAGGTANGNNCCNNCGNGAAAGTGTGTANATGACAATATCAAGATTGT 

NNGGAGA 
SEQIDN0558 

CTTTGGANANTCCGAAGAGATNAGGNAGACGACCCTGATCCTGNAGGCTGAGCAAGAANNNA 
GNNCACAATGAGCNATNGCTANGNNAGCNGACANGCAAACTANCTCNNAANCTNTNCTGGTG 

ATNCCGNT GATC AN GGAGGNAGCCTT CN ACCAGAC AGN CN TG ACAGGA 
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SEQIDN0559 

CNCTTNGNCACAGCCCTATTTGTATTTATGTTTGAATTTTATGACAAAATGGTCGTATTTTT 
CTCA 

SEQIDNO560 

ACGACGCGTANAANATCTGAAGGATACCTATGNNCAANCGAACCAATGCACGGATATCCNTT 
TATAACCCAAATCTTCAGTNGNGAATATCTCTNCAGTTCCTTTTCTATTGC 

SEQIDN0561 

CGAGTTTNATGGCGNNGCGATGTGGACATTCGTTGTGGNGGCCTAATGCTGAAAAGGGNTAT 
TGATATGGCAAGAGGAACCCTCTGCAATGCAGAAATTGANNGTGGCTCC 



SEQIDN0562 

ACNTGACTTCNTNCAACCAGCCATCTATANNANAGGAAA^lATANTNTGAGGATTCCCA 
SEQIDN0563 

CTGTCCNTTTTNTGNGACCTNGTGCNGGCNTNCTCTGANNGNGCCCNGTNAGCGNCCAACTC 
NNATCAAGCTCCTTNCAANTGANTGAGGACATGATGNGGTNATTTACTCGTGANGAAAGGCA 
GCTNATTCCTGACCCNATGGAAGCAGNNAGGAAATCNGCTCCTNGCTNCNNACTGNGCANGG 
NTNNANNGTACTCGNCCATACNGANGTCNCACANNATTGCTANATTGTTNCTAGCA 

SEQIDN0564 

NCTGGCAGTACCAAAGGTCCTATGGATTGTTACTTCNCGCAAAAATCTGGAGATAAGGAAGG 
AAAAAGTGGTAATCCTCAAATTGATGCCAAANCGATTTTGAGGGATCGTGCAATTACAATGT 
TTGCGCGGTGGATGTATGATGCAGGTCTTCCTT 

SEQIDN0565 

GNNGCTAGCAGCTCGGAGTNTNTTGNGGTCCNGCNGAAATTTTTNNTGGNGNACACNGGAAN 
TTGNNNNATNTCTNATGGNGTATGGTAAGAACTNATTTTTTGANATTGANGGNCGANATGTT 
CTTNGGGGGGGNNCCGTCACACCTGTCACTTCATTTCATTTTT 

SEQIDN0566 

AAAGGAAACTAGTTGGAACTTGTT 
SEQIDN0567 

AAGAT GATGAGCAGATTGC AAGGAGGAAGCAC CT 
SEQIDN0568 

ACTACGACTGGCAAAGATCAAGTTGTAGTAACTAATAAATACTCGAGAGAGAACAGTGGAAA 
TCTTTTTGT 

SEQIDN0569 

TNCANGNCGCTGCNCANGTTCCTNGGNAAACAGGCCGNCTTGGGTTGTACTCAGGTACTCAT 
GAAACTTGTATAGNGCTNGTAAGAAGTTTGNGTNGTTCGGT 

SEQIDNO570 

CTGGCNGNACCAAAGGNCCTGNAGCCCGGTANATTCNCCCCTGTAGCTNCANACTTCCTGAN 
TNTACTNTNGATNNNACATTATGGGGNNAGACCACNATNTNNATNNTCNTCAGCTNGTGACT 
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TCATGAGNTNTCTTGGCCATGNNAAGCTAAGACATCAATATGTGAGNGCGNTCACGAGCATA 
TGCCNGAGCAGACATTCATAGAGACTCTNTTATTAGTGG 

SEQIDN0571 

TTTCTGCTGNNTCAGTGAGGTTAGATCGTAATGGAGCACTTTTTATGGAGAACATCAAACAA 
GAAGTTGAAAGTATTGATGCTGATGTAACACCTTCTCGAATACAAACTGCCT 

SEQIDN0572 

TTTGCATGNCTTCGAAGGNCAGTGCTTGNTCTGAACCCGTNNCTTGGACTTGACAACTAGCA 
TCTTCTCTTTGCATGCTGCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTA 
TGTATTACAAACTATTATTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGA 
AAGTAATCTCTCT 



SEQIDN0573 

GGNTTCTAATTTCTAAGTTGATGGCTCAACCAAAGATATTTAGTACTGAACTGATTGTACTA 
ATTGTTCTATAAAATTACGGGGTTTAGAT 

SEQIDN0574 

ATATGATGNAGTCCGGAAGATCGAATNTGGGGAAGGTCCTTCTGGGATCAAATAGGCTAGAT 
TTACTTGTTTTTCCTAAAAATGTAATAAGGCCAAGTGCCAGTAGTGACTTATTTTATTATTT 
TAGTGTCGTTTTGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCAT 

SEQIDN0575 

TTTAGCTACGANGGTTCTCTNCGAGATTATATTCTCAACGCTNATGCNCACGCNTTTGCTTC 
TCGTGT 

SEQIDN0576 

CAGNGCTNGAGCTGAACCCGGNGCTNGGACTTGACAACTAGCATCNTCTCTTTGCATGCTGC 
CCTCATNTATTGCCAATGTT^ATTTCTCCNNTAGCANANCATNATGNNNTACAAACTATTATT 
ATGATNGTGAANAACTCAGTGAAACGTTCAANCAATCTGAAAGAAAGTAATCTCTCTTTCCT 

SEQIDN0577 

GGGAGGNAAANTNGCCCTGAAACNTAAGAGGCTGAGACTTGTCATAAAGAAACAAACTNTAT 
TCANGCANGAGAAGAAAGCAGTAAGGAAAATCAGCAAAATAGCAATGAGGTTAGCAAAGTTA 
TTGATAACAACGGAGGGACCAAGGATGTACAACACAAGAAGGAGAACATGAACAAAAGAGCT 
AT G AC C AC T G G AAAAAT T G AGC AG AT CAT G 

SEQIDN0578 

ATACGAAGGTTCAGTGCTAGTAGCTGAACCCCGTTGCTTGGGAATTGATAGTTTGGGTGACA 
G 

SEQIDNG57 9 

AGAAATCTGCCNGGTTTGCATGGATATAAGCAAATGCTCAAGAATGGTGCTTGGGAACAGTG 
CATGTCTGCCTTGGAGCCCTCTGTGAAGGGCAAGCTG 

SEQIDNO580 

GCTCAAGGGGAGGTGGCNCCAGAGNNNAGTGCGNGGTTGGGGNAAAGGGTGCAGATTCTNCN 
AAGGNCCGTCAAAAGAGCCCATGNCC7\ACAATTTACTAATGATTCAC2\AGAGCNNTGGGGGN 
GG 
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SEQIDN0581 

NATGTGTATTCCTGAANNANCTNANTGNNCAATTATTCAACCANTNATTNTACCAAGTTCAN 
TGTTANCCAGANTANNCNTCATTNATCTNTNTACATGCNTCACTAAGATNTTATTTGTAACA 

AGNGGTTTTGTTGGNTGG 
SEQIDNOS82 

GCTCTAAAACCAACCTTTATCAGTCAGAAATCAGCTTTCAAACTCCATAAACACAGCAGTTT 
GGTTTTCTTCACCATCGATTCTATTTTCCGGTCGCGGTTCGTCACATTTTTTGAGTTCAAAG 
CTATCAAACAATTGAATTTTAGACTTATTTGAGGTTTATTTCTCCCTTTCCGCTATTATTTT 

TGG 



SEQIDK058 3 

TCTATCACAATAGAGTCCTTTGCTCGGNGAAGAGATGGGGCACATCAAGCCACATTATCCAC 
TCATCACAATAGTAGAGGCCACACACAGAGAGGAAATGCGCCAAAAGGAGCTGCTGTATATA 
CACATCAAAGTTATAGGAAGCATGCATCAGGAAGAGGAAATGGGCTTGTTGGAGCTGCTCTA 
CATTCTCGTCAGAATAATATGGGCATGGGCAGAGGACAAGTGCCAAATGGTGTTCCTCAACT 
CAATCATCGCAATGTGGGAGGTCAATTTCGCGGACGCGAAGCAAAGAATTCCCATTTGG 

SEQIDN0584 

NGGAAGGNGTTTGGTGTANGGGTGGGGGGATTGGGGTGGACCCCAGGTGGGGGGG 
SEQIDN0585 

AAANGAGGAAAANTAATNTATGGCTANNNACANATGACAAGGACATAAGGTAACTNNGCATT 
CTANCC 

SEQIDN0586 

TTTGGCTGCCNTTTGCTAATCCNTTGCAGTNTNTGTGCATAATNNGAGTAGGGGTATGAAGA 
TGCCCACCTNTTGTTCATTCACTTNAAGGATAATTACAAGCCAACTATGGAATGTGACG 

SEQIDN0587 

AGANTAGAATGTTGTAAGAGTATTGAACTCAAAGCAGTATTGTAAGTTTGTAAGTAAGTTGA 
AAGTATTGAACTAAAGGCTCGAGGTATTCAACTCGAAGTAGTGGTGTAAAAGTATTGAATTG 

GAAGTGCG 
SEQIDN0588 

CCCCTCATTTCACACATTCTTGAAACCAGGTGCACTTGCCCAAATCAGGTACTCCAAAATCT 
CTGCTAAATCAAGGTTGAAAACTGTTCAGTCCCTGTTTGCTATTTATAGTCAACAAATTCCC 

SEQIDN0589 

TTTGGGAANCTTTTGGAAGGTCCCATATACGTNTTTNNCAANTNANCCNGGGGCCTTCTNGG 
TTTTTTTTATGNTTTNATACGTCGGNTTGAGAANATTGNNTGNTTACAAGNAGGTGAGGAAT 

ANATAATATGATTCCTTATCTTCTTTGCG 
SEQIDNO590 

CAGCGGAATGCCACCGAGGCGATACCAGCGATGCTGCACGTGATGGCATGNTCTGCTTCGGC 
GCGACCTGGGGCAGAATAGAGGAATCCGGTATAGCGCTTCTCGCCCAACCGGTACTGTGCGA 

GGAAGCTAATCTTATTCATGCCG 
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SEQIDN0591 

GGCCNGTAGTTGGGCTNGNNACGCNCCNNAGNACCNACTGGCCCNGNNAANGAGNATNAGNT 
NNTCATGCNTTATACNGGNACTNACAACCCACCANCCATGCCATAGCAAAGAAGCGAGNTAT 
AAACACAAGNTCNGGACCTNTGCCTATNCCAATCAAAATTTACAAAGCCACGGNTACAAACT 

NCTAAACG 
SEQIDN0592 

TTTNNNCCTTTCNTNNCATGNTATACGAAGGTCAGTGCTAGAGCTGAACCCGTTGCTTGGAA 
TTGATAGTTTGGGTGACAG 

SEQIDN0593 

CCAGACTCGCGATANCTGNNTNANCTAACANTAGCATTNTGANGANGTACCTGNGACTTNCA 
CATAGCAGCGGTGGGTCGAACAG 

SEQIDN0594 

ATCTATTGGATTTATGCTTTGGNTTCTGCTTCTAAAATATAGAAATTCTGGAGAATTGAAGC 
TCGTTTCTATTCGAGGTTGCAATTCCAGNTCGAAATCATGGNCCATAGCTCGCTCGAGGATT 
GCTTTTTCTTTGGAGATTATTTTGCACTGNACCCGTTGAAAAATTTTCAGNAACAAAGGTCC 

ATCTTCCCCATTGCAACTTCA 
SEQIDN0595 

AGANCCTATGATAACANGATNGGAGGACTCATGGCTNAGGCTTGGCTGGAAACATCGGNGCT 
GGGGCCCACCACTCTGAACCATATCGTNTAGoACGGTCTCCTACTAGCTGGCCTCCACCTCT 
AGCGGTCTGGCCTCCACATATAACGACCTGACCTATACCTNTAGCGGGATGACCCCTACCTC 
GNNCTCCCAGACCCCTACCTNTAAATGGCTTGGCAGGCAATGCCAGAATCATGGCACGGTGA 

ACTNTGNTACTGCGACTGAACACCCA 
SEQIDN0596 

TCNCAGCATCGCAAGTGATTTACTTTGNCTGGNGCCNCCAAGNTGGAAGGANGTTAGCCCTG 
TAATCAAGGCGNTNNTGNCCTTGCCTC 

SEQIDN0597 

CNCANCAACCNTTGTANATATGCNCTTTTTACGCTCGAAATTTTTTAGCTGATTGAAGAGGG 
TNNTCTCCNTCTTGGCAGGTATAAGGGGAAAGAAGCTGCTTATTGTAGCAGCAAGTTAGNGA 

TC 

SEQIDN0598 

GAGACCGTTGGCCGCATAAACAGCTCCANCTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTT 
CTTCTGAAATATATATAGACAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATG 

TGTTCCTAGCTATTAGTTTCC 
SEQIDN0599 

CTTGCCGGTCCATTTGCAGGTTGAAGTGGCAGCTTCTGGATCATGAAACGATTGAGTGCAGC 
GTCTGCCAGCATCCATTCCTTG 

SEQIDNO600 

CNCCACGTCTNTGTGCCGNAGCCNCCCCCTCGCNCCAATNCGGGTGTCATTNCANCGNCANC 
GATTTTTACCTACAAGATAGGTGGNTCGGATCGANNCGCNACATTNGATCAGATTTGNCGGT 

GC 
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SEQIDNO601 

TNCNGNCCCNNTTTTGCNCAAANCCTTGAANCTCCAACCACTACCACCCCAAAATACCNACA 
TNNNTNGATTNAGCTCTTCAAGACCTAGCTATTGNTGNCAATTCTACCCCAAAAATCCGGCG 

ACCAAAATCTGGC 
SEQIDNO602 

TCNNNAACACACCCTAACCTTCAACNCCC 
SEQIDNO603 

NCAACAGCTCTAACTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTTCTTCTGAAATATATAT 
AGANAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATGTGTTCCTAGCTATTAG 

TTTCC 
SEQIDNO604 

NAGCCCGGAGCTTTTNAATTCTTTCATAACCCAAGGAGAAGAATAGGACTCTTTACCAGTAT 
CATAACCTCTCNATGGGAAATGGAACTTAGATCACGATGTGAACCTACTTATGAGTGGAATT 
TCGTTGACAAGCAAATTCCCCGGGAAAACAAACTTTTCTCCAATTGAGATGCTCTCTTCATT 
TATGGATTCTATGCGAGATTCGGTTAGTGCGAAGTGTGATCCTGGCTNAGAAGGAAAAGCTA 

TATGC 
SEQIDNO605 

GGNCCAAAATCGGNANCATCTCC 
SEQIDNO606 

TNCCCAGCATTCCGCGCTACCANAGAAAAAGATGGATCCACCANAGATNAAACAAGTTATAT 
TGGGTATAGGCATATGACGAANACCAGAGAGACAAGGGCAGTTCTATGAT 

SEQIDNO607 

GACCCTTCCCNCACCGTNTCGNATCTTCGNTTGAAGANTCGAGCNGGACCCCAACCTATGTC 
ANNCCCCCCCAAATCCATACCAGGNATCCANCTGNCCTCCCTTGNGACCAAACCAAGCTTGG 
CTTTGNCCGAATNTAACCAGAAAANCCCANGNCCNAANTCAGGTCCAAGAACCCTAGAAATC 
CGGAATCTGAGGGTTTTGTNNGA 

SEQIDNO608 

TGCTGTTTTCAGGTCTGCTGATTNTGTGACGACGTTAGAAATCTAGTCTCAATCCCACTGTA 
TGTAGTGTAGAGTAAACAGTTTTGTTGGGCAGCTCAAGAGCTGCTGCAGGTATTTGATGTTA 
GTTCCACGGGCTCTCCAAAATCTTGAAGGCCAGATTTGAAGAAATATCCTAAAAATATGTCT 

TCTTATTCG 
SEQIDNO609 

GNAGNGGCGNNCAGCATCNNTNGATCTGAAAGGGAACATGATTGTNTGGTNAAACTCGTAAC 
GGTAATTAATNACCTTGNTANGTCC 

SEQIDNO610 

GGGGTCGGGTTTCCGGCGAGTCAAGGNGTAATCTTGTTGTTCTGACAACGAGTCGATGTNGA 
AAGTCACGTAACTCAATACCAAAGGAAAGGGCN 

SEQIDN0611 
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CCCCGGACGTTTGAATCTGGGCCAGGTCCTTCTGGTACCAAATANGGCTAGATTTATTTACT 
TTTTCCTAAAATGTAATAAGGCCAAATGCCAGNACTGACTTATTTTTTATCATCTTAGTGTC 
GTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDN0612 

CCCCNCAGCTTNGAACATAACCCCCCGAGCATGACTGCTTNTGATTTACTTANCTTATGCAG 
TTTTNNANACGTTCCCACAAGAACACGTTCNTCGTTGNCAAACAGAGATTNGAAGGTTTGTC 
ATGATTCTGTTACTGNAGATGAGAATGCTCATGAGGGCGGGCTCCCTAAGGAAACTGAAGTG 
CATTCCCAAGACATCTCTGTGGATGCGAAAAGCCTCAATTCTGAGAAATTGAAAGCGCCATC 
CATGGAGGAAGAATCATGTCTTACTTATGCCA 



SEQIDN0613 

NGCCTACNGGCACNTCGGCTTNNTACTTNTGTGGATGGCTCCNNGCTAGCCAGTNTNAGANA 
NTAACNGNTGCATCCGNGACNTATNNATGAATTNCCATTGTTGTCNGATGGTNGGTCAGGGC 
ATAACCTGTTANGNTGGANANCATGATGTGCTGTGGATACACAAAGAATGNAGGCAGACATT 
CACAGAGTGCTTTCTCCAATAGCACAAGAAAAGGAACCATCGGTTTNTACACCCAGAGNGGN 
AACCCCNATTGTTTCCAANCNAAGCAGTAAATTCATGGGAAGNCCTTCTTCACAAGCAGGNT 
CATGGAGGCCCAAGCATCCAACAGTTGTTGCAATAAAGAAGCAAATTGTGTGGAGTCCTCTG 
AAGATGAAGGCCATGAGAAGTAGGCAATAGGAAGCCCCTCTCTTC 

SEQIDN0614 

ACAACGGCTAGGTTCCGCGAGTCANCCTGGNAAAGGAGCCTGGNNANNGTANAGANGACCGA 
CAGTNNCGNATACAGNCNCGAGAACGTNA 

SEQIDN0615 

CCAATAGGCTCAAAACGCAACAAAAACCAAAAGAAGAACGAAATTCCCTTGNTTGGATTCAT 
AATCTCAATTGTCTTGTTTTGTCTGGTACGTGAAAATGTTGATA 

SEQIDNO6I6 

GGAAGTAGCTGCCTNCTTGTGNTGAAGGCTTGCNGCTGTCTNCCTTCATTTGTTAGCCTAGT 
AAANNTGGCNTATATNTNCGATGGCCGCTCTCATGTGNTAAGCACNTTTGCTNAACCATTTC 
TATGATAGCATGAGAATGATGATGCTATGAGTTACAATGCTGGGA 

SEQIDN0617 

NGGAGGGGTCCGGNAGATGAATCTGGGAAAGGTCCTTCTGGTACCAAATAGGCTAGATTTAT 
TTACTTTTTCCTAAAATGTAATAAGGCCAAATGCCAGTACTGACTTATTTTTTATCATCTTA 
GTGTCGTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDNO6I8 

CAAGGTTTTGGTCTTTCTTTTTTGGAGATTGGTTGTGCTATCTTAGCTCCA 
SEQIDN0619 

TACCCACNCCACCTCCCGCTGCTGNTCCTTTGNCTTCANCTCATTCNAAGCNTGACNNCACT 
NCCAATGTGTAAAGCTNAGNGGCGTACTCGCT 

SEQIDNO620 
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TNGTTGCTTCTTCTCCACGCCTTCTCCGGCACTACTTCTTCTTNTCCGGTCGAAAATTCGGC 
AGATCCCTCTCATTTTCTGGCTGGGCCGTTCATCTTNCTCAGCACACCACAAACAATCGATC 
TTCTCGACCTCTCAACCATAAAGCCACCATCGAATCCCTCTCATCCGTTTACTCGAACATAC 
TAGTTACAGAACTAAATAAACTTTCAAANTTTTTGCTGTT 

SEQIDN0621 

GCACTACAAGAAGCTGCTGNGGCTTCTTGNAGGGTTTTGTGNGANNATACCACTCTTGATNN 
NTGTTNNCNCCGATGGTTATNGGTTTCANNGGAAGCNTCTTCAAGTCTTACAAATCTTATGA 
GGNAGCAANAGNAGTATTCAATGACTTCCAAAAAGAAAATATTTACAGTGAAGAACAATCTT 
CAAGTTTGTGTATTGATGAAAGTGATATTGGAGCAAGTGTTATGTCATCTGTATTGTTAGCT 
GGAATGTTTGTAGGGATGAAGATTTCAAAAAGTTCTTCAGTTTGATTGTTGATAAGAGTATT 
TTGCTCAATTTTTTATTATNGCTTAGNTTGGGTATTATTAGNTNGATTGTNNAGTTTGANGN 
NATACTGGNTGNCGCATTCAACCTCTGTNGAATNGAGTATTTAGGATGCCNAAGCCNTTATC 
TTTTTGACTCCCNGTTGGNATGNAATAAAANATGTCTGNTGATT 

SEQIDN0622 

ANTACANNTAAAGGTNTTAGCTGCTGACATTTNGAATTGTCGCTCAAGCTGNTGNTTGGATT 
GCTTGTCNCTGAAATTTGNATTTTTGAGTGTTCGAGTNCGATNNCAATTTCAGAAAGTGAAG 

CTACATTNTGTTGAATCTNCTATTG 
SEQIDN0623 

CTAGGCGTGTTAGTCGACAAAGCATAGCCCACGTTCTGTGTTTTTGGATCGCAGTTCATCGT 
CAAATTCTAGGCGTAGTTCTAGTGGTACTAGTTCGAAGCATCCGTACAGTAGCT 

SEQIDN0624 

AGAACCAATCCCCAAATTTTTGGGGTACCCACTCCACCTCCCGCTGCTGCTCCTTTGCCTTC 
AACTCATTTCAAGCATGACAACACTTCCAATGTGTAAAGCTTCGAGGCATACTCGCT 

SEQIDN0625 

CAATTAGCNTGTGCNAGNCANAANAGGGAAGAGAAGNAATNTTTGTATAGCTTCTTGACAAA 
TGTAGGTNTTAGTGATCCTTGNTATTTACTTAT 

SEQIDN0626 

TTGTAATGCTTTGTTATCCACCACTGGTGTCGAACAATGTTCAGTGTTTTCTTCTAATGGTT 
AGTTCAAGTTGTTGTGGATAAATGATTATACTGTGCTCTTCGTAAACATAGGATGCATTTGT 

ACCAT 
SEQIDN0627 

TGGNTGATATCATTATAGATATAGGGCTTCACTCCCTAATCNNTNTTTTTCCAAGGTNTACA 
CAANCCTGATTNTTCNNCT 

SEQIDN0628 

AGGTTGATGAAGAAAATGAAAGACTAATAGTTGATGAAGTATGTGAAGCAATGAACAAGATC 
AATGTTTACAACCGATCATGAGTTTGAAGGAGTAGAAGAAGAGTGTGCTGAATTTGCATTTG 
CCTAAAGGAAACCACTTGCGTTTCCCCGAAGATAACTGAATGAAAAACTTTGTTTTTTTTCC 
GCTTTCTGTGAAGACACCAATAGCTGAGGTGTTTTAGAAAGTATTACATTCTG 

SEQIDN062 9 

NTTCNANCGAACNNTCCATGTGCTCATTNCATGCAATGCTGATGNNNAANNGTGTCCANNNG 
GCCGTTTACNCNTNGG 
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SEQIDNO630 

TGACCNAGGACCAANATTGAAGGAACATCAACAAANGACTTGTTACTATGAATCTTTNGCTT 
GNCGANTAGAGCTTATNTATTCTTATGATGNTGATGATGANGCTNTAGGCATNAAACTTCAT 
ACTAATATCTTTGNAATTGCATCTGGATGTTCAACTTCTAAGAGTTGTGATGGNCTTTAGAN 

TTTGAG 
SEQIDN0631 

TTTCTTCAAGANTGCCAAANNAAGCATGCAATGAGCAACGGTTGTCACACGACATATAGCAC 
TGTCAAGTTACTNACAAAAAGTGAGAAAAAGAAAAATGAGAGAGTCTTACTAGTGAAAACCT 
CCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAAATGAGAGAGACTTGTTGGTGAA 
AACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGCTGATGCGAAGAATTGGCATAAA 
CAAGCCTGACTTCAAAGGTCATAAGAATGGTATAAGGGGAAGATTGGATTAGTTTGGTAGAT 

CGGTCG 
SEQIDN0632 

GTGCAGGAGNTGGCCCAAAAGNANGGGAGNTGAATTTACTAATTCTGNTGNTGGC 
SEQIDN0633 

CCACNCCCCCTATTTTCCCCTATANGCCCNTTCTACATTGGCACNTTTCACAAACAAGNACG 
CTNACCCTTTNTTATGTNGGACTCTGTACNC 

SEQIDN0634 

GAGGCNTTNCATTTGANCTTCATTGNACCAACAACTTNACCACCATGGCACACTAGTTCCTT 
GNCGACGGGAAGCACCATGAAAAACGCTGTCCCTCACCACTAAAAGCTCACCGGAAAATAGT 
NGCCGGATAAGCTTCAGCACACCCAGGACCCTTCTCGCATCTCCTTCACACCAGCGACCCCT 

CCCCCCCGGNCG 
SEQIDN0635 

NNNACGNTCTCGAGTNTGNNGCCTTTCTCAAGACTGCCCAAANAAGCATGCNATGNGCAACG 
GTTGTCACACGACATATAGNACTGTCAAGTTACTTACAAAAAGTGAGAAAAGGAANAATGAG 
AGAGTCTTACTAGTGAAAACCTCCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAA 
ATGAGAGAGACTTGTTGGTGAAAACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGC 
TGATGCGAAGAATTGGCATAAACAAGCCTGACTTCAAAGGTCATAAGAATGGTATAAGGGGA 

AGATTGGATTAGTTTGGTAGATCGGTCG 
SEQIDN0636 

TCCCCAAANTCTGNTTGAATGAGNGNGCCCANACCAGGACNGCTTNGCCGCTAGACCCGGAC 
AN ACN T CT TT TCG AN AAACN CAT C G AN C AGGGC A 

SEQIDN0637 

TTTGGAAATCGCCCAAGACAATTTCTGGNATCGGGGAAGTTTGNAGAATNNATGCTATTGGC 
ATAANTCAGNAGTTTNNAGATNCGAANCTGCCANTAGACTCGCTAAAGCTGGCGCCTNACNT 



SEQIDN0638 

TGCCCTAAAGCCGGGGAAAATCTNATTGGNGGCTGAAAATGAACCAAAAAAGCTGAAGACAA 
AAGGAATGATCAAAGAZU\AGGTTCGTAAATTATATTGATACANCTCTAGACAGTCTCCA 

SEQIDN0639 
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TCAAAAGGCAAGCAACCCCTTTGGTGGGCATAAGGGTATAAATGCCG 
SEQIDNO64 0 

NCTCTACACAGAAACTCGAAACCTACGCNTGACGGTCACGATTTCAGTANCCTTCCNNCTCC 
TGNGT 

SEQIDN0641 

TGNGGTGGGGGAGCTCGTCACCTGTCTATCAGGACCTTGNGTATACTGCCCAACCTGAAGCT 
ATGCAAATGTCACGACNCCTTAGTCT 

SEQIDN0642 

GATCCCTCNCTCAAATGCATTCTGATCAACTAAATTTGAAAGGCGAGGGCAATCGATGTTAT 
AGAAAGGGGTTTCGTTGGTGAATTTTCTTTGTTCATTTTGCGCAACAGCTTGTTGTCTTGAT 
AGTGAAGGAGTTTATTTTGTTTACAGAATATTAGTC 

SEQIDN064 3 

CCTACATCACCAAAGCTATCATCTATGAGCTGGTGGAAGGATGGAAGCACTCCATGTTTCAC 
ACTGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACC 

CTATTAC 
SEQIDN064 4 

GAAGGCCNTTNCGTTNNACACCAATGAGCCCTTTTCTTCTAAAAAACAAAAACACATTCAAA 
AACCATCCTTAGCAGCAGCAAAAGACCTCTAAAAATAAGTTCAAACCAGCTTTTTCTTTCTC 
CCTAAATAGTATGAAACCCGTCCAAATAAGC 

SEQIDN064 5 

AATCTTTTCACCATCGGCCGCAATAATCGCCTCTGCGGCACGTTCAATCTGGNGTGGGCTCA 
AGAACAACAAGTATTTGGTCTGCGGATACTGCGCTGCTGCTAGTTCTTCTTCGGTGGTTTCC 
TCAGAATCTTTCAGCGCCTTGATGCGCTTTGCATCCACTTCCGCC 

SEQIDN064 6 

GGCTCATATCGATTATGGATCAGANAT.TACCGGAGAAGAAAGATTTTTACCTTTTTAGACTT 
ATACTAGGGATGAAACTCTNCTACTATATAAAGAGAAAGGTTTTCTTTTGNAACATATACTG 
GAACATGCAAATCAAAGCAATAGGAGTTTATTTTCTGCC 

SEQIDN064 7 

GCNNTGGCNNATCCCACTNTATGGGCGGTAGCCAGGCGTATACCGAGGTCGGACAGATCACT 
TAGCGCTGNCGGGGGAAAAGGGCTTTGCATAACCCTNGCAGGACTCGTTTGNTTTACNCGCN 
TGNAGTNAGGACCTTTGNTGCGAGGNAGCCCGTAAAGCCGAGCAGCAAAGNCATATTCCTGA 
GCTGGTNAAATATTTCNGNCNGACNGGCCACGTNCC 

SEQIDN0648 

CCNGGAACCTATTGACTCGACCTCAATCAAAGAAAAGGGATGGTGATTTCGCTCCATTTCCA 
GGCTGNTTCCTGGTGTTCAAAGGGTACTTTTGAGTGGCGTTTCAGGNGGNCTTTTTTAGCAA 
CGACACAACTATTTCGAACAGAGGTTTCAGCTGCGNTTCGAACAGTTTTGAGAGNGATTTCT 
GGNGGTTTNCGGGGCTAGAAGGATGCTGGTAGAGTTCTTGTCCGAGGTTTTGACATTTCAGA 

TTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 
SEQIDN0649 
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CNCATCACCAAAGCTATCATCTATGAGCTGGNGGAAGGATGGAAGCACTCCATGTTTCACAC 
TGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACCCT 
ATT AC 

SEQIDNO650 

TCGTCTACGGANGATTGNTCAGGTACACGCTTCTGAAATTATGGATTGATGTACGTTTGAAT 
TGGAAGTTGAGTTGAAGTAAACAAAGNAAATGAATCGTTCACCTACTTTCACAATACCTGTG 
T T TC AAAT GT AGC AAT AG G A 

SEQIDN0651; 

CTACGGNNAACTCCTCATCTTNNCCCTTCTACTCCTTTGATGTCCAGAGCAACATTTTCCGG 
TGCCGGAATTGTGAAAGGGAGGTCAGCGCGAGCAGAATCACCAGCCATTGTGGCAATTTGGC 
ATAGTAAAAAGACAATGGAAAGGAAGGATGAAAGTTTTCGA 



SEQIDN0652 

CGAATGTCCTGATTGCACTGAAATGAAATGAAGAGGAAGCATATTTTTGTTGAAATTTCCGG 
TGGCTTCAATGCTNTCATTATAGNTTTGNAATAATTTTGGACTGNATTGAACTGATGAACTG 
TTAGGCTTGAGTTTGATCATTTGGACTA 

SEQIDN0653 

CTGGTGTCGAACAATGTTCAGNGTTTTCTTCTAATGGTTAGTTCAAGTTGTTGTGGATAAAT 
GATTATACTGTGCTNTTCNTAAACATAGGATGCATTTGTACCAT 

SEQIDN0654 

TTCTNCGGCAGAAGTCAAGCTATCTATCAAGTGCACTTGACCATGATAAGGCGACAATCCCG 
GAGGGTAACTCTAGAGGAGGTACATGCTCGCGGCTTTGATCTCTCAGCCGATATTGAAAGGA 
CGAAGATTTTGGAAGAAGAGGCTGCCACTCAGCTTTCTGATGAGGATGATTCAGCCAGTGGC 
TCTAAGAGTGGAGGAGACGAAGATGAAGTCCCCGAGGGTGAGGCTCTCGAAGATGCGGCTCC 
TAAAGATGAAACTGCTGAAAATATGACCCCGAAGTAGTTTTGGGTTTCCTTATTTTGTTTCT 
GTTCAAGTCTCCCTTATGTAAATATCTCCTA 

SEQIDN0655 

GNNTGCTCTTGATTTTTCTGAAAAATCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGTCAT 
ACCAAGGAGGTGAGGCTGAAAGTAAAGAGAATGACGACAATTCATCTATATGGTCAATTCAA 
GTGAATGCAAGTACTAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGA 
AGAAGAATATGATGATGATAACTATGATGAAAATGAAGAAGATGGAGATTTAGTTGATGAAC 
TGTGTGAAGCAATTAGCAAGA 

SEQIDN0656 

ATGAGGTGTTGGGTTACATCTCTATTTCCCTTTTTGTACCNTCCACGTGGACACTTCTTCTC 
CTTTAGTTTTGATTCTTTGTCTGCAATGCCCCTCTTTCCAACCTCTCAAATGCCTGGACAAC 
AGATAATCTCGTTCTTGTTTGNTGCGACAAATGTTGTTCATAAGTTGTGTTTATTATAAGAT 
ATTGAACATCATAGCTTCCACTTAGTTCTTTAGCTAATGTGAAAGTTGCTTATGG 

SEQIDN0657 

ACGTTGAGAGCCGTAAGCCAGAAACTGGAGAGGAAGATACAAATGCATCTGCCGGTTCAACT 
GGAGTTGATAGCATGGCTGATAGCATAAAATCATTCACTTGTAATCAGAATTTTACAGATAC 
TGAGGCTTGCACGTCAGCAATAGGTCTATCAGCTCATGATGATCAGGCATCAGATATTGCAG 
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ACCCTGAAGAAGCTGCTGTGACAGAATCAGCTGTAGTAAGTCAGGAATGTGCCTCTAATTTG 
G 

SEQIDN0658 

GGATGAGAGAAAGCCAAGTCGGACGGTTTGGTGAANCCAGAACTAATTCAGCAGATCGTTAT 
AGTGGACAGAGAAGCTGATTTTGAAAATGCTCTTCAGAATGGTGGGGGGAAGATAGCTCCTG 
GTGGTGTAATCAGTGTAAAATCCAACAAATTCAAGCTTGAGAAGCATTTNGAGCCGGNGACT 
GAAAAGAGTGGCNAGAAAAAGAAACAAAAAACCATTCTGGA 

SEQIDN0659 

AGCCTGNCCTAAACCAGTNTTGGATCTNTGCTCTGCTGCCATTTGTNGAACCATTGGCACAG 
TGGAACTGAAAAGAAGAACGCGTCCATGCTGTCCTTGTCCAATCACTGTCCA 



SEQIDNO660 

AAAGCAACTGTTTNTTAGAGTNCATGGGTTTAGCCATGGCCCATNCTTNATTAGNCCNAAAC 
ACTCCCNAAGATATNGATATTGGNCACAACAAAGGCCCGTGCAGAAGATGGTGTGCCACTCC 

CACCA 

SEQIDNO66I 

ACGGGGNNNTTGTCCCATTGACGTATCTCACAACTATTTTAANNGNCAAACCCGAAGTGGTA 
TGiGGTGTGGTCTGCAAATATGAACNCTCACATTCTTCCCGNGGTGCGTAGTTAGCTACAAA 
TATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGC 
TTCAAATGCAATGGGNACA 

SEQIDN0662 

TTGAGAAAGTTTTGTTTTTAAGACNGGTTGCTNGGAAAGNATGGNNGTTGGCCA 
SEQIDN0663 

TTNNAATAGCCATACAAGGTATATCGGNGGTTANTGCATGTTTTTNAACTTATGGNNCACNC 
ANNATTGTTGTTGATCCANGGTCACAAANAGNCAAGCNGTCANGNTGNANGAGANAANTNAA 
NAATGGAGGCANATGTGGNGATGTANNTACCAGTTGTGAACAATANGACATGNACT'GTTCGN 
CATGATTGGCACNATTTGTGNGGNGAATCCNAAGCAA 

SEQIDN0664 

GANGACCCTATGCTGATGATCCCTATGCGTTTGGCTAGAGGTGAAGATGTCCCACTCCAGTG 
CAGAGCTTCCTAGAGAATCTGAAACTTTGACCTGGAAATGTGTGTGCGCTGATTCTTTGATT 
GCAGACGTATAGCTGGCTGCTTTCCACATTGCAAGGAACTAGAATTTTACTTCCCCCAAAAA 
TAAAACTGTATATAACTGCAA 

SEQIDN0665 

eCCTATGCGNTTGGCTAGAGGTGAAGAATGTCCCACTCCANGGCAAAGCTNNCTAGAGAATC 
TGAAACTTTGACCTGGAAATGTGTGTGCGCTCNACTTTGATTGCNNTACGTATAGCTGGCTG 
CTTTCCACATNGNNAGGAACTAGAATTTTACTTCCCCCAAAAATAAAACTGNATATAACTGN 
NATTACTCAGGACTCATNATCCTCCTGCTCAAGTTGCTCAAGTTCCTGGAGCAGAAGTGATC 
CCTGCTCCAGCTCCTACTGGCTGGGAATGAGACCTGCTTCCTTTAGAAAGTTCTTTTTGA 

SEQIDN0666 
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GANNGNCGTANACGAAGNCAGGGGACTGAATCATNAAGTATGCACAACGGAGCTCTATTTGT 
TNGTTCCACCNTGTGTTGGGNGGGNGGAGTGGCTNCCTANTGATATGTATGTATNNTNNGAG 
CCAAAGNTCATATTATACTTAANCCTACTGNGCNCCTATAAAGAGAATGCCGCGAGATTCAG 
AAGATGCTTCTGATCTGTGA 

SEQIDN0667 

GGAGGCTAATAAGTTGAAGGCATTGCAGAGAGCTGCTGCTCGAACCTCTCATATCAAGTCTA 
CGTGATGGTTTTCACATAGAGCTCCATAGAGGTTTCTAACTAATTATATCCTTTCTTATTGT 
AAATGCTTCAGATTACCTTCAATCTTGAACGTCCAGAGACTTGTCCAAATGATAAATCTTTT 
TACTCTTTCACCCAAATTGGATGTCATTTTCA 

SEQIDNO668 

AATCTGAAGGGTCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGACATACCAAGGAGGTGAG 
GCTGAAAGTAAAGAGAATGACGACAATTCATCTATGTGGTCGATTCAAGTGAATGCAAGTAC 
TAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGAAGAAGAATATGGAG 
ATTTAGTTGATGAACTGTGTGAAGCAATTAGCAAGA 

SEQIDN0669 

GCCANCCCAGTCGACAAGACCAGCGCCTGNACGTAAAAATCTGATACCTGACTAAGCTTATG 
TCCTGAGGGAGCCAACCTCCCTCAGGCGTCTGTTACTACCTGCTGGCTT 

SEQIDNO670 

GCCGGCTCTGNGTCCACCTGACTATCAGAAGCGGCNCAGATGATTGCATCTGTATTANAAAC 
AANGGAATCTCCATCTTCCATGANTGNGCCTATAGACATCTCTCTATAANTCATTTTTTTTN 
CTTNNNCANAAATNGNCGGAGATACTNTAGCTTCATNANTNGT 

SEQIDN0671 

GGGCAAGTGGATGGTGGGTACTGNCNCGTTCGGAGCTCGAAGGTTTCTGNNNCTGGATTGNC 
TGTCTATACCATTATGTGATGTNACCNAGATGGCATCGCATCTTGAGGCCCACTCTCATCTN 

GCTTNTG 
SEQIDN0672 

GGNGCNATTGCCNAANTGTGCTTCTTGCTGGATATCATGTGTGAGTGTTATCTTCAAGAACC 
TCACAAATTTGTAGTTGATCAGAATCTTTGCAATGCGTTTTCTCATTTTCTTTCATTTGTGC 
TTCCTTTATTTTGTCTTTTACG 

SEQIDN0673 

GGTGCTGAATTGGAGGAAGGAGAANAGGANNNGGANGAGGAATGCCTAGNNGNNNGNGTGCA 
TAGANTCCAACTGAGTCACGCAAGAAACCAGTNTGTTCCACTGNTTGGCTTNCTGCTAGGGN 
TGTTGAGTCTTTGAATAAGAACGTTGATGGN 

SEQIDN067 4 

CACCATTCTTGATCGTAGTCCGAGATTCCACGGTGAGCTGCTCCCTTCCTATGTCGTTCAGC 
AGCATGATGGAGTCTCTCTTTGCTTTTGGTTGTCTATTCTATTTCAGACAGTTGGATAGATT 
TATTCTTTTATATATTCTGCTAGATGCCCATATACTTGTGACACCAGGTCTTGACACACACA 
TTAGTAGACTATTCTTTTGGGATTGTATAATTATTATTGTACGTTGCTAATTATCACTTGGT 

SEQIDN0675 

GGGGGNNGNTTNTCTCTCCGCTGGAAANNTGANTGACTTGGGTGCTAANTGATGGNAGACCN 
ACACACCCAANAAGGGNAAGNGGAAAGGACGACATGGNTCAATAGCNCAGNGAGGGAGACAG 
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ACGGAATGAAACGANNCAAGANANTGGGGNNACCNTGTTCTATTTANTGTGNNAGNNNAAAC 
AACCCACGTTCCTNACAAAACAAACAGTATTTTGGATCGGAGACTAATCTGAATTTTCCAGA 
CGAGTTTTTTNCGGTNAATCTNGAGGTTCCGACATGGNTTTTTG 

SEQIDN067 6 

TAGGGAANCNATNCTCATTTGTTATGACCACCATTTACTTAGCT 
SEQIDN0677 

CTCNGNTANCAACACGGCTGGATAAACTTCAGNGCTCCCGGTGTGGGTCTATTTATCGGAGT 
TTGAGCACGACNNACACCCCGGGACCATNTAGNTAGGATNGCTCATTCANGAATAGC 



SEQIDN0678 

TCAGAATGCGAATTTGCCTACTCAAATGAACGAGATTCCTGCTAAGTGGAATGGCAATCCGG 
AAGGTTGTAGTTTTGTTCGTCCAAGCTCTTTCTCGGCTTCCTCATCACCTGCAGGTCCTTTT 
AGATCATCATCTTTGTATTATTCTGCCGGCTTTTCATAACCAAGAATGTTGCCTTGCATGGG 
CATTTACTCTCATGACAGACAATAGAAACCTGACGCTTACAAAGCATAAATATAGCAGTCTG 
AACGAAAACACACACGGCAAGTTTGAGCAGATGAGTTATTCTAGATTTGCAGGTTTTGCT 

SEQIDN067 9 

TTTGGCCATACAAAGGGNTGAATATGAGGNATATGGGGGGNTAGGCATATGTCGCACAAACC 
CTGGNAT 

SEQIDNO680 

ACCCTACCGGGAGGATCATATGAGCGTGGGTTCTACTGGCCTCGACGTCCTCTGTAGTTGGA 
AGGGAAACCAT 

SEQIDN0681 
GGTGTTTTAGGTTGTCT 

SEQIDN0682 

TNGCCCCNGCCAGTCGGACAGAANCGGNTAGNACCGAAGNCNATNCTGCCACGGGCANGGAA 
GACGT 

SEQIDN0683 

AGAGGTGGTGGGACTGTTCGTTCGGTGCTCGAGGTTTCTGGTTCTGATTTCTGTCTATACCA 
TTATTGTTGTAACCGAGATGGCATCGCATCTTGAGGTCCACTCTCATCTTGCTTATG 

SEQIDN0684 

GCNGNGNNCAAGGNGGCTACCTGACNTNACTNAATAAATCAANCTNTTTGAACTCAGGGTNT 
ATAGGANGAGATGGAGGCTCATGCATGGTTGACACCAGGGTTACTGGAAAGANGGTTTATCA 
TCCAAACCATAACATTGACACTGAGGATGATGCACTTGCGCTGAAGTTGTCATCAACCACAA 
CCATTGCTTCAGATAATACGAGCTCATTATCTAATGAGGAATCAGCAAACTTAGCAAGTGTT 

ACTTCACTTTCTG 
SEQIDN0685 
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CGTTACATATTAGGAGTATAATTTTTTCATTACTAAAGCATGTAAATATGTTGCTCCGGGCT 
TTGGTCTATTAGTAAGAGCGCAATGCGTGATATGTGGG 

SEQIDNO686 

TCNAGCAATTANNNNNTTTGGCCTGCNGGTNCCTNTGGCGCTGANGATCTCTATGCCCCGCC 
GGCAGACGGTGGATTGGATGATGACAATGCTCACG 

SEQIDN0687 

TGGNNNNTCCTNNCNNGCCAATAACCAGCCCCNGGNGCTATCANCATAANCTAAAAAGANCC 
CCATACANTCAACCTGGCTGGNCCATCACTTAGGGCNNNGTTTCAAGATTATCCAACTTGGG 
NAATACTTATCCGCCANGATCNATAGCCGGATCAGACNGACG 



SEQIDNO688 

AAGACAGGGATGGCAGTGCTGAGAGGAGGGCAAAGATTGAGCAATGGAATAGGGAAAAAGAA 
GAGGNAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGAGTGATGGTGGTGA 
TCACTATGGAGAACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGGAAGT 

TATGGGCTACTGATAGTAGTGGTTACTCTGG 
SEQIDN0689 

NAANCCCAGNANNATTCNNGANGCAAGGGTTGATAGCGACTATCANGGCTGATGATTTTTCA 
CCGNGCTTNGGCGGGAGTAGCCTGTGCTCATTGACNGGAACCCGTNTCGCAGGACCTTCGCC 
ATGAATCGNTTTCTCGCCATTTCCGTATTGCTCGTCANCTCAGTCCTTGCCGGTTGCGCGAC 
ACATTCGNCGCCTGAACTGCGTGCCTACTCGGCGGAAGAGAGCAAGGAGCTGGCGCTGGAAG 
CCCTGAGCCGTCGAGGCCTGTCGTTTGATGAATACCAACAGAAGAAAGCCGAACTGACCGGC 
CAGCCACAAAAAACCTTTGGTTTCGACCGCAGGGTGAAATGAATGNCGAGCGCGGNATGACG 
CTCCACGGCGCCCAGGTGAGTTAAGTGACAGGGCNTGAAAAGCCGAGGGTTCCACANGAACC 
TCGGGTTTTTGNTTTGCCATCCCGTTTCCGGAGCCTG 

SEQIDNO690 

CCATNANTTNACANTGCTGGNNCATNNACAACCCGGTGCGCGGTTCGCCGTTGCGCGGCAGT 
TCCGGC 

SEQIDN0691 

CGGTACGAGAAGCGTGTGATTCAAAAACAAGTGTGATCATGCAAAGTATTGAGATGGAATCT 
TGGAATGCATGGAACTAGCGTTAGATTTGGTTGAAATTTGTAATTCTAATCGCAAGC 

SEQIDN0692 

GGGCCTCCTAGCAACATTTAGGAACCGAATAACAGCACTTCTCAGTCTATACGGCATCCTGA 
TTTGTTCATCAGCTCGTATTTCACAGGCTACCATATCACCAGTGTTCCATGCTCAGCC 

SEQIDN0693 

CCTCNGGCAGGTACTCAATAGCNAACAACTTTTACATCCTCAAATTAGCACAAATCTACATA 
TTTCATATACAGAACACTATAGTAGAGTTCATGTTTAGACTATTGCCAAGTCTGCATGATCT 

AAACAACAACTTCCACC 
SEQIDN0694 
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TTTCTTGTTGCTCGTGAAGAGCCAATAACCAGCCCCCGCAGCTATCAACATAATCAAAAAAG 
AACCACCATAGATCAACCTGGTAGGCCCATCACTTAGGGCAACTTTCAGATTATCCAACTTC 
GAAATCTTATCCGCCATAATCAATAGCCGCTCAGACTGACG 

SEQIDN0695 

TGGTCGNTGNAANAATTTTGCTGGAAGCTTTGTNNAATGAAAAATTGNTGCTTCAG 
SEQIDN0696 

GCCAGCTAAGTGGCTTTATAACACCAAAAGAAAGAGGCCTTAGGACAACTAAATATGACATA 
CACTTAGACAACATGAATTTGCCAATTTATCTGTTACTATTTCCATTGACCTCTAAACTCAC 

CTCCATGCA 
SEQIDN0697 

NCGTAGCTATCTTTGCTGCTTCTTTGATGCTTTGAATCATCTTTGATCTGTGACGATATTTT 
GTGTTTTATTTCGCCGGAGTTGAACAGTTAGGAGTTTATTTATGGNTTTATTTTTCACTGTT 

TTTTGTTCATTCTTTTTTTTACTTCTTGACA 
SEQIDN0698 

GATGCATGTGTCACAGAAGAGATGCCATAGTTCCATATTAGGAATTGATAAGATGTGCTAAG 
ATCAATATAGGTCACTTAGTATTATTCTCTTCTAGGCACTAGTTTCAGGTCATATTTTAGTT 
TTATGGGATGCATTTCGTAAACTTGTTCTTGCCTTTCAGTTTCATTTTGTATGTATATGTCA 
CTGGTCCATATTGTTGTTGACACTCGGCA 

SEQIDN0699 

GATGGACGTGTTATTGGTGGTGGAGTTGCCGGNCTATTGGTAGGCTGNCAGTCCTGTGCAGA 
TTGTTGNGGGCAGCTTNCTTGATGGAATTCAGCTCGAGCAGACGACCAAGANAAACAAGTNC 
GAGCCCATAGNTGNAGCTGNTCCTCTATCTAGTACAGATATGGAAANCGCCTATNACTCATN 
ATNAGCANAACCAACTGTAGNATCGGATTCTTCCTTACATGAAGATAACTGNNCATNATTAG 
CCCNCGACTTGAGGAATANNCCTGCTGACATCAATGNATNTAANCCTGCATAGGTTTTTGTN 

GAANTGNANTTNATCNG 
SEQIDNO700 

NCTGAAGAAGGTCCTNTCGGGANGAAATAGCTAGGNNGTCTTNGNTTCANCT 
SEQIDNO701 

NCTGATTGTTCTTACAAATAGGTCAATCTTAGTCCAAGTAAGTATATTCTCTTACTTCTGTA 
NTTTTCCAGATTTGGT 

SEQIDNO702 

TAAGGGATCACGACCCTACGGGAGATCATATGAGCGTGGGTTCTACTGNCTCGACGGGCTNT 
GNAGNNGGANGGAAACCAT 

SEQIDNO7 03 

GAANGAAGGTTCCAAGNGNCTCCCATTGTGGAGCANTATCACCTACACATTGTAGGGCTAAT 
TATCTTTTCACTTCACNCGGTAGAGGANCAGATTGCATAGCTGT 

c 

SEQIDNO704 

NNTGATGTCCCTTCCTTTCTGGTGTCGTATCCGGCTTTTTNCGTGGAAGCGGTGTTGCTAAA 
TCGNGTGTCCGACGGCCCTTTACTGTACTGGAACGACATTTCTCATTTTGTTGCTGCTGTTT 

ACGGT 
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SEQIDNO7 05 

GATATCTTCATCTTTGCGCTTTATGTTCTTCACATCCACT^lGTATTGGTGTGTTTTCTGCAT 
TATCATTTCTCAGTAGTTTCCTTCTCTGTTTCTCC 

SEQIDNO7 06 

CGAAGAAGAAGANACTTACG 
SEQIDNO7 07 

CACGGAAATCACGCCGNCNTTGGTACCTTGACCGGGTTNCCTANAGGGNACTTCAGTCANTG 
GGNNGCCCAGNNACTGAGGNGGCCG 

SEQIDNO7 08 

CATCTGGATTNAACAATTTCATGGCCAGGTTTTCAAAAAAATAAAACAAGGTCTTCATGGCC 
GTGC 

SEQIDNO7 0 9 

NAACCTTACTGTACAAAGGAAATCATTGGTTGCTTGGGATAAAGTCTGCATGCCCAAAACTG 
AGGTGGCC 

SEQIDNO710 

CGGNNTTTTGACAAAGGTTCCCGCTTACACACTCCTCGTNCGATGNGCTCCCTGACCCGAGT 
GTTNTCGCGCAGCAGTGTCATGNTCAAAACCAGGATTGNNTTNAAAANGACAGGACTTCAGG 

TCATTNATTCCGCC 
SEQIDN0711 

ATTNTNNTTTTTGGAATGGTAAATACAGGTTGGATAGAAGCTTTCCCA 
SEQIDN0712 

GCAAATANTTATANGAAAAGGTCAAGGAAACACTAAGTGTGTCATAAATAGGATTATCTATT 
ANT A 

SEQIDN0713 

CTNNCTTTGNTNGACGAGAGTAANANCTTGGCAGCTATCTTCCAAGCCATTTTCAAGGGCTN 
TGCATCTGTAGTNCTNTGCA 

SEQIDN0714 

CTGATAGTCTGATGGGCTTCCCTTTGAGGGTAACCCGACCTTTCTCTCTGGCTGCCC 



SEQIDN0715 

GANCCAGCGGNANTAGCTGCTGTACTANNNACAGGNATCCAANATATGAAAGCT 
SEQIDN0716 

AGCACNTCCGGCTGTATCTTACTACCAGAGAAATTACAGNTGTGGACATATCTCGAAGATGA 
ATCAATNGAATATATCTNCTAATGAAATTGTCTTGCTCTTTNGTTGNGTAT 

SEQIDN0717 

ACTGGTCCAAAAGCTNCAAAAATTTGTCTAAGCTGTACTNTGNCATGNNGAAATGNAGATTT 
CCNACATAAAGTTTTCTCTCTGAGGCAGCATTTGNGCCTGCCAACCCTGANNCACCACCANA 
CGCAGTTGACTGAACAAGGGTTTTTTCAAGCTTCAGAANGCNTTACCAATNNTGGGTNGNCC 
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AAAAANCAAGGCANCGGGTAAANGAATTGGCCATNGGNCCAANCTTNGNNTATAAAANNNNA 
ANGTCCCNAANTCNTTTANATNGCNTNGAATNCCGGCCNNTGA 

SEQIDN0718 

GATTTAGTGATNAATTTCCAGCTTATTTTTTGNTGTGAGAGGAGNGCAGTATCAGNACTCCT 
TCTGGCGCCAGGATACCATNAACAGGTAGCCATCGAAGGTGTACA 

SEQIDN0719 

GGGAAGNTCCAAACAAAAAAGAAAAACGCAGTAATACCCTCCAAAAAGCTTCATCTTTCTCA 
CCAAAGCCTCTTTGCTTTGGCCATAGAAACCAGTAACCATTAGCTATGTAAAACCATTGCAG 
CTACCATTTTAGAAACAGTTTCGAAACGCCA 



SEQIDNO720 

AGCCCTTNGTCAGCCCACCTNTTATGCTCAATCNCACCGNNAGGAANNCTGNNAGAGTTANN 
GANGCGATTGATTNCNGCNCTGACAGATCATATNGCTTCTATAANNGTTGNGCGGACACGCG 
AATNAGNTTNCTTACCCTCGCATAAGACANATNCTGATCTTACCAACCACTCATTAGATGTG 
GNACCTACAGCANCTACATCTTCTACTGCTGCTAACA 

SEQIDN0721 

AGGCATTNTNCAGNGCGCCACAAGTATACTGGATTTCCCGGAGACATGTGACTGGAGANGCA 
TCACCGCAAGATTTGTCCGCTCAAACTCTCATTGATGCTGCCATTGNCATACAAAAGTGNAT 
TCAGCGGGTGGATAGTAAGGTCTTCTCTTGTAAGCACGGACCAATCTCCTACAGTTCCTAAG 
GAATGTGAAGAGAATATAAATGCAGCGGNAGCAATCCAACATGCTTCAAAGGAATATACA 

SEQIDN0722 

GGAAACATACAACGAATGCCAAAATCTGCCATTTTGA 
SEQIDN0723 

CTTTCCGAATGCTACCNGATNGTATCAATTGGGGTGAACTGGTTGGGGTTTTTTTCCCCCTT 
TACC 

SEQIDN07 2 4 

TCNTTGCNGANATCTACCTTACATGTTCCTGATGCAATCATGACTTACTCTGATTTACACAT 
GGGTTGCTGNGGGCTGATTCCATGTCC 

SEQIDN0725 

GCCTTAAATGGTGTGTTCTAACAGGCTTATGGGTATGCTGGCATTCTCCATTGCTGGGCATA 
CCCACAGCCTGCCCTTGCCTCTTTCAATTTTCCTATTCCCCC 

SEQIDN072 6 

GCGNTACTTCANAGTCNNGGANAGAGGCTAAGAGGNCNNACANNAANTGCTTCAGTACTAAT 
GAANCANATNCTNGNNTCTTTTTNAGGGACATANCAGGTTTTGACAAGCCCCCACATGAATA 
AGAATATATNANACTTCTCTAACC 

SEQIDN0727 • 
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TTACTGTTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACA 
ACAAGCTTTTGGTTACTTTTTGTNAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATG 

CTATGGTAGTGGTCGTATCTCGAAACACGTGATATTTAGTGC 
SEQIDN0728 

GCCTTGGTAAGACATTCGTGAAAAAACTCTGTTATTTCTTAGAGATAAGGTGGTTCCCGC 
SEQIDN0729 

GACTCTTGCCAAAATTGTATCTAAATCCTCATCTTCCTTTGGGATTGGCCAAGATTGGCTGG 
CAATGTTGGGGCATTTTTGTTCGAGTTGTTCATGTTGGACAAGTGACTACCTGATTTAGATG 
TTGCAGAGCAAAGCTGTGCGATTGTGTTGTATGTTATTCTTCTA 



SEQIDNO7 30 

GCCCGACGCTTGAAAGATTGCATTTGGAGAAATGCCAATTGAGAAATAAGGAGAGCTTGAGA 
ACATTGTTTCTACTCTGTCAAGACGTCAGAGAGGTTATTTTCCAGAACTGTTGGGGACTGGA 
TAATGAAATGTTCAGCCTTGCCAGNGTTCTAAGGAGAGTGAAGTCCCTTTGCCTGGAAAGCT 
GTTCACTACTCACAACTGAAGNCCTTGAGTCTGTCCTCCTTTCATGGAAGGAAATCCAGAGC 
CTCAAGGTGATTTCATGTGGCAATATAAAGGATAGTGAAATCAGTCTAGCACTGTCTACCTT 
GTTCTCCGCACTAAAAGATTTACAATGGAGACCAGACTCAAAATCTCTTCTTTCAGCTGGTG 
TTGNGGGAACTTGCATGCGGAAAAGAGGCAATNAAATTTTTCAAAGAAGACCGTGNGACTTG 
GAAGTCACTTGCCTGGGNGCATAGACTGGCCTTCCTCATCATGCATTCAGGACAACTCTACT 

ATAT 

SEQIDN0731 

TAGGTTTTGTTTAGTGTTTTCTAAGTTCTTGTTTT 
SEQIDN07 32 

GAAGCNACTAGTTCAAAATATGTGCAGTGTTGATCATATTCTTTTGTTATGGCCAGTTTTTA 
CCATTTGTTGGACACGTTTGATGCTGT 

SEQIDN07 33 

GCCCGTANNANGGGTTCCNACACCNNCNTANGGTCCTNTTTTCCTTCTGAATNGAGCCTGCG 
ATAAACTCCATANANAACTAGCAAAAAGAGCTCCATTTTTTCACTAAAAACAACCGTTCAAA 
CAGCTATGANAATCCCTCTATCTCCATCAAAACCGCAGCATCCATCATCCTCAATAAAGGGC 
TGCACAAACCTGCTACAATCAGCATAAAAACAGCCCTGAAACTAGCTTCTTTCGAGCTAAAA 

TCAACT 
SEQIDN07 34 

TGTGCTAANGTAGCCCGNTCTTATCAATAAGTGCAAAGTTTGG 
SEQIDN0735 

TAATAAAGCCCCGGGANAAGNNAAGAAAAAAAAGAGAAAAAGAAACTAGGCCGGGTCAAGGC 
AGGCCATATTGNNAGCACTACTGCCTGG 

SEQIDN07 3 6 
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TGNGACCTTTTGAATCTCCCGAGTCTGNAGGTCTAGTTTACTCCCAATAGACGAGTATCACT 
ACAAGTCTACTGCAAATGGTTGATGTTTGATGTGGGAGACGAAACGATAAGCAATTTAGTAA 
CATGTGTCCTTTTTCACGTATATATAGATAGAGCAAGAATGAAAATGGAGACACCTTTTCCA 
TTTTTGAAGGATATATTGCTGTTTCTTCCCTCAAAGAGAGTTTTGTGCACTATGTTTGGTAG 
CTTTTCGAGAGTAGTATGTTTTTATCTCGTTGAAGCAACCTCCTTTTTTCCCCCTTGACTAG 

TTGACTTGAAGGG 
SEQIDN07 37 

TGTTTGTTTATGACCCTGCTGGGTCATTGGTAATTATGTGTTTAGTACTATGTCTTGGTGC 
SEQIDN07 38 

CTTTCGTCGGAGCTTTNGCCGCCGCCGGCTACCATCAGTACAATCCTCCGNGCTGGGCGCCC 
SEQIDN07 39 

TGAATCACCTTTTAAAACCACGGGNAAAAGTAAAAGTAAAAAAAAGNAGGAAAAAGGAAACT 
AGGCCGGGTCAAGGGCAGGCCATATTGACAGCACTACTGCCTCG 

SEQIDNO7 40 

TGCGAGACATTGCAACTAAGCAAGCTCTTTCCCTACATTGNCGTATCCCAGCACACAGATAT 
CACGGGGCATGGAGCCATCCNNCAGTGTCAACCAGTGCGCTATATAGGCGGNGACATGCGGC 

GCG 

SEQIDN07 41 

GNNNGNGGAGNAAGCAAGCATAGAAGGAGCAAANTGTTCATTCACTGTGAGTANGAAGACAA 
AGCAAGAAATAATTCAGAAGCTGATTGAAATAGTAAATGAAATATCAAGCA 

SEQIDN07 42 

CCCTGCCTGGGAAATGGTCAATTTGAGGAAGGGCATTGGCAGCTAACTTGTTATATGCGCAA 
AGTCTTGTATGACATAGAAGTAGATGGCAACAGACAACAAGTTCCACCAGATGATTCCAAGG 

TT C AAACT C AAGAAT C AAGG T GT T TT T AT G AAG GAAAT C AACAT AAT G AC AAT GAT G GCT AC 
TGGGACTATAACTTCTTATTTGGAGGTGCAGGTGGAGGAGAACATAG 

SEQIDN07 4 3 

CCTCAGTCTGAAAATTCCAACACCAATATGCCCCAATTTGATTCTAGCTTGACCCGTAACAA 
TATTGGATCAACCCCATATTATGGAAGTCATGAAAACATGACATCAACTAATTACCATATGG 
NGANTTATCATAATATGGTGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGT 
TCAATAGATTCTTATGACATGCAAACAGATCATCACAACAGTCGATGGACATTTCAAGATGA 
TGGAGATGACCTTCAGTCAGTGGCTTTCAGATATCTTCAACATTCTTGATCAGTANNTAGGN 

CTTCAAAAACAAATCATGGGTGAAGA 
SEQIDN07 4 4 

TNCTNTCATGGTGNGCCTACATTCNGGACACNGTANTGATCCTNGCCAGCANGATTGTCTTA 
CGCTACTACANTTGGANCGATNNGCCTTACCTGNCGGTTTTANTNNGAGGACAATAAGNTCG 
ACCNTCCNATCTGCCTGAGCATTNNNNCTATGATGANCGATNGGGAGGNCATTGTGCCATCT 
GCGAGTTGAANGATTATCCACAGTGAGAGCCGGAAACCCCTGCAATNCNANANTCTGGGT 

SEQIDN0745 

CCTTNNGTGGCTNGNGNTGTGCTCTGCGT 
SEQIDN07 4 6 
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GNTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACAACAAG 
CTTTTGGTTACTTTTTGTCAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATGCTATG 

GTAGTGGTCGTATCTCGAAACACGTGATATTTAGTGC 

NCTTTGAATTTGAACCACTACCTAATATGAAAGAATGCCTGCTCGTAATGAAATACTTGTCA 
TGGTGTCTCTACCGAGTCCTTTGGCTAGGGCAACTCAATCAATATGCAGTCGTAAGAATGTT 
TTGAAATGCATATGTAGTCATCATCGGTGTTTTCACATTTATGTGAATTTGGATGTTCG 

CCTGCTTGAGGTCCATTCTTTTTTCTCCTTTNTTTTAGTTCGATAACACTATATGCGGGTCT 
CTGATGGTTGTCGCGTNTTTTTGGGTGC 

SEOI DN07 4 9 

TTTGGAATACAATTCAACTTCTGTTTCCTAAAGAAATAGAAGCAAGAAAAGCAGCTGGAGCT 
TTGAATAGTAGAGAAGCTCGACGCAAAAGTCCAGTAAGAGCTGCTACAGCTCATTCTAACAT 

C T C T AG C AGC AG AAT AT C AAGAGT GTTCGCGC 

CAGTATCCCCCTTACTTGTGTCAAATCANCTTNTCCCAGTATGGCTTCCATATTTTGACTAC 
AATTCTTATCAGAAGGCATGATAGTAATAAGTGACAAAGATGCAAAAAACATAAAAGTTGTC 
CTTCACTTTTGGTTAGAGGCTGAAGATGAACTTTCTAAGTTGGACA 

SEQIDN0751 

TTCGATCGGTGAAGCTTCTTTACCAAC 

ScAAAGNAATGCNGTNCCAAAATACATTGAAATAATTGGCAGCCGAATACTAAACTTGATC 
ATGT 

CCCGAATTTCGTCCGCCAAATTGTCGTGCATAGGAACAGAACGAGAGCCATCAATGCCGTAG 
GCGCCTTTCGCGTACCACATGACCCGAGAAAAAACACCGGAAAGGATTTCCGTGATTTGTTT 

CTCGGTGTAGCTACGCCGCA 

TTGGTNTTGNCACCTGCNAATGGCNNTACATGGAGCAGGGACGNNAATAAGTGGNACGAGTG 
ACCACATGAGGGAG 

CATCTCNTCCTCACTTCTTGAACTGTACGCCCACCCTTTTTCTTCTTGGNTNTGTTCTTANA 
AGTTTCTGGCACCTGCTTTTTGCTTCTATTATCATCAGCTTCTTCAGGA 

NACACCAATATGCCCCAATTTGATTCTAGCTTGACNTGTAACAATATTGGATCAACCCCATA 
TTATGGAAGTCATGAAAACATGACATCAACTAATTACCATATGGAGANTTATCATAATATGG 
TGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGTTCAATNGATTCTTATGAC 
ATGCAAACAGATCATCACAACAGTCGATGGACATTTCNAGATGATGGAGATGACCTTNAGTC 
AGTGNNTTTCAGATATCTTCAACNTTCTTGATCNGTNNCTATGNCTTNAAAAACATATCATG 

GNTGATGA 
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SEQIDN0757 mmm 
CTTTGGGGCCGTTCTTGGNATCCGTCGAACTAGGGTGTTGAAATTTCTNTTTTTTCTTCTTT 

ATTGGGTTCTATTATCGATTNCATGNGATATTTTATTTCCTTATTTGTGTTTGAGTAATNGT • 

TTTCCATGTTTGCTTGTTCGATTTCTACCACTATATAACCCCTCCCCAATTACCCTTTTGGA 

CAGACC 
SEQIDN0758 

GGTANCTCTNGGNCTGCGAANANGNCTCTNAGCCTTNCNCAAGCGNGCGCGAGAGAAGCGGC 
NNACNNAGCTACCGNTTCACCCGNCCGACTAAAANACAACAGNCGCAGACCTACTTTGATTC 
ANAAGAAAGGNGACGGNTTCGCNAACANGNANNCGGNTTTCTATCANAGGTGCNAGGGTTCC 

AAACC 
SEQIDN0759 

CCTNTGGNGTTCTGNNAATTCTTGTACACANAAGGGCAAAACAAACAAAGGAAGAGCAGCAA 
AGTATGAGTAGAGCTTCAGTAGTACTAGTAGCTATTATGGTNGTGGAA 

SEQIDNO7 60 

GNGGCATTCGGANCGATGGATTGGTCTTCATAACATTCATCATCTTTACATTGCAGCATTTC 
AGAAG 

SEQIDN07 61 

TCAAAANTANTNNCNTNCTNGNNCTGCACATTGAGCATGTGCTCANCAACCTNTNTTGTGCT 
CNNTNTTC C CCTG AACAT AGN AGT AT GCAG 

SEQIDN07 62 

TAGNNCCTGAGACNNAGNAAGAAGACAGACNGTCACTGCAACGCCNNANGNGAGCATGACNN 
GANCNGNGGNAC 

SEQIDN07 63 

GGCACAAGTNNAANNGCCTGTNTCGAAGGTGNGGCAACAACC 
SEQIDN07 64 

CAACGTAAAGGATTCAATTCTTGTTTTGTTTGTTCATCATTGAAATAATTTTTTTTTAGTCT 
TGCATTATATGTTTGGTTGGT 

SEQIDN07 65 

GGCTTGGNGGNNGCGGGTGNCCACCATGNNATGCATACANTATNCATGTANGNNGCTACANA 
GACACATTNGGAATAATGNGTCGGATCGNTTAGNNNTGGG 

SEQIDN07 66 

CNCGATTNNATACAACCCTGAGAAAAGAATGTTAAAAAATGACTATCTTTTGTAAAGAAACC 
CCTTTCATTTCCAGGCAATGCAAGGGGGATCACAGTTTTACATNGTGGGTGTGGTTATTTTA 

C GT C AC AG TT 
SEQIDN07 67 

ACGATCGATNANGTGGNCTNGNAACATTCANCATACTTTACATNGANATNTCANAGGTTACN 
CAGGNCTCATCANTGGNNNAGCCTNTGCTCANCG 

SEQIDN07 68 
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TCGCACACAGTATCATGAGAATNNTGGNCTTGTCATCCTCAAAAGAATCCTGTNANAGCATG 
G 

SEQIDN07 69 

NTCTACAATNGCATACANCATCAAGCATAGNCAATCACAAACATGTCATGTANAAGTCCTGA 
AATTTCGATGTCAGGACTAAGCTATAAGNACTACTACATGGAAAGCATATATGTGCATTCGT 

NGTCCAAGCAT 
SEQIDN0770 

GAGCCTGCTGGATCTTCTTTCTCTTAGCAAAGAGGAAAGGAAGAAACTAGTCGAAGAGCGCC 
CTGGAATCAATAATTCTACTATTACTGCTCTCATTTCTCTAAAATGGAAGGAATTGAGTGAA 
GAAGAAAAAC AAGT G T G G AACAAC AAAGC AGC TG AAG C AT AC AAAAAGG AAAT G GAAGAGT A 
C AACAAAT CT G TAG C AG AAAAGC AGAACAACAATT AG AAAT AGT AGAAAT AAC T AT AAT AT G 
TTCAACTGATTATGTTGAACATAGAATGATTGCTAGTTAGTTGAAGTAGTAAATAGGTATCA 

TTCCAATTTCCTTTGTTGTTTAGTAGCAG 



SEQIDN07 71 

TCCGNTGCAANCGGNNCTTNCACNCTTAGCAANAACACNNTNCTGGGGATTNNAGTCATGCC 
ACAANTAGCAGGGGCTNAGNCGNCC 

SEQIDN07 72 

GGTTCTNCTNTNNCTGCTGCGCCTGACAGCANTTGTGTGGNTCTGNCGCTGCACNCNNCNGC 
TGTNTACGCNGGAGGNGN^AANGGNTGNNCCTGNTNNGGAGTCACATGATGACANGNGTNAN 

ANNTNGTTNNA 
SEQIDN07 7 3 

ANGNGCTATATCTTCGNNAGAAANACTGCTGCGCAGTGTGNAANAGCGTGNNTTCACGGTAT 
GNANGGNNGATNNNACTNTGCAGNAACTNCNA 

SEQIDN07 7 4 

CTGTTGNTCTTTGGNCACATGATGATTCAGNTTGNNAAATNTGTGG 
SEQIDN07 7 5 

ATAGTAACGTGCCTCTTTGTTTCTGCNNTCAATTNGGCTANAGTCNAGTGGAGTAACGCGTG 
NGCCATTNTTNTNGAAGCTGTCGG 

SEQIDN07 7 6 

NTTTATGCCGGAANAAAGNNAGGCNAGNATGCAGATGCNGGNNACATAACGCTAATATGNGG 
ATGAATNAGGACNAGCAGCAGTGAAACTCCTTCCC 

SEQIDN0777 

NGAGTNAAGGGCCANTCTGAATNTGGCCTAATNTGGNTAAANNGNGGGGAGTANGCCGNACA 
NANTNATTCTTGTGGNTGGNNNNNCGTTNA 

SEQIDN077 8 

CTGATATGGGGATTNNGAGGCAAGGGGTATGGGGNATCATGAAGNTGGTTGCAG 
SEQIDN07 7 9 

GANNAGGCGCTCCCTCCTTNCTTTGTGATGACANCNATNGAANGAGAAGACTCCTA 
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SEQIDNO7 8 0 

GAAGCATAGCCCNGCGCNGNTNGCGTNAATGAGANCACAGATGGNNCTAAAANATGANTGNT 



SEQIDN07 81 Mr% 
CCGCCTANTGCCTGTTAAGTCTAGCAACCTCCTCNAGAGTTNGGG7^ATTCACAATGGCAGCC 

SEQIDN0782 

GTANGGCCGAGTNAANGGTAGCAGAACTTNGAATGTGGGACNNGAGNGTACAANGCGTCNGA 
CANNGACTTNGTGTANANNC 

SEQIDN07 8 3 

GGNAGCGCTAGATGANCAAGACACAATTGATATGCAGTCTTAGGAANCTAGAGAGAGANTGT 
AGANTANGGTGATGAACGCACNTNGG 

SEQIDN078 4 

TATTTNCCTGCGTGACCTAGTAAANATNGATAGGCCTCNANAGGTGGGGTTANTNAGGNCTC 
ATCAATNCCNAGACCCAAATCAGGCAATC 

SEQIDN07 85 

AAGCNGANNGACCTGTNTTGCACCTNAATATCCNNAGCCAAGGAAGANNGACGNTGGCTGGA 
TGANNNCAATNCTTNNANNAACCANNTACTGNCCN 

PRIMERS 

SEQIDN07 8 6 
CTCGTAGACTGCGTAGT 

SEQIDN07 87 
GATCACTACGCAGTCTAC 

SEQIDN07 8 8 

G AC GAT G A G T C C T GAG 

SEQIDN07 8 9 
TACTCAGGACTCAT 

SEQIDNO7 90 
GACTGCGTAGTGATCNNN 

SEQIDN0791 
GATGAGTCCTGAGTAANN 
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